Safe human-robot collaboration (HRC) has recently gained a lot of interest with the emerging Industry 5.0 paradigm. Conventional robots are being replaced with more intelligent and flexible collaborative robots (cobots). Safe and efficient collaboration between cobots and humans largely relies on the cobot's comprehensive semantic understanding of the dynamic surrounding of industrial environments. Despite the importance of semantic understanding for such applications, 3D semantic segmentation of collaborative robot workspaces lacks sufficient research and dedicated datasets. The performance limitation caused by insufficient datasets is called 'data hunger' problem. To overcome this current limitation, this work develops a new dataset specifically designed for this use case, named "COVERED", which includes point-wise annotated point clouds of a robotic cell. Lastly, we also provide a benchmark of current state-of-the-art (SOTA) algorithm performance on the dataset and demonstrate a real-time semantic segmentation of a collaborative robot workspace using a multi-LiDAR system. The promising results from using the trained Deep Networks on a real-time dynamically changing situation shows that we are on the right track. Our perception pipeline achieves 20Hz throughput with a prediction point accuracy of $>$96\% and $>$92\% mean intersection over union (mIOU) while maintaining an 8Hz throughput.
翻译:安全的人机协作(HRC)随着新兴的工业5.0范式近年受到广泛关注。传统机器人正被更智能、更灵活的协作机器人所取代。机器人与人类之间安全高效的协作很大程度上依赖于机器人对工业环境动态场景的全面语义理解。尽管语义理解对此类应用至关重要,但协作机器人工作空间的3D语义分割仍缺乏充分的研究和专用数据集。因数据集不足导致的性能受限问题被称为"数据饥饿"问题。为克服这一局限,本研究专门针对该应用场景开发了名为"COVERED"的新数据集,其中包含机器人工作单元的逐点标注点云数据。最后,我们还在该数据集上提供了当前最先进算法的性能基准,并展示了使用多激光雷达系统对协作机器人工作空间进行实时语义分割的结果。通过在实时动态变化场景中应用训练后的深度网络所取得的令人鼓舞的结果,表明我们正沿着正确方向前进。我们的感知流水线实现了20Hz的吞吐量,预测点精度超过96%,平均交并比超过92%,同时保持8Hz的吞吐量。