With the recent development of autonomous driving technology, as the pursuit of efficiency for repetitive tasks and the value of non-face-to-face services increase, mobile service robots such as delivery robots and serving robots attract attention, and their demands are increasing day by day. However, when something goes wrong, most commercial serving robots need to return to their starting position and orientation to operate normally again. In this paper, we focus on end-to-end relocalization of serving robots to address the problem. It is to predict robot pose directly from only the onboard sensor data using neural networks. In particular, we propose a deep neural network architecture for the relocalization based on camera-2D LiDAR sensor fusion. We call the proposed method FusionLoc. In the proposed method, the multi-head self-attention complements different types of information captured by the two sensors. Our experiments on a dataset collected by a commercial serving robot demonstrate that FusionLoc can provide better performances than previous relocalization methods taking only a single image or a 2D LiDAR point cloud as well as a straightforward fusion method concatenating their features.
翻译:摘要:随着自动驾驶技术的近期发展,以及重复性任务效率追求与非面对面服务价值的提升,配送机器人、服务机器人等移动服务机器人日益受到关注,其需求与日俱增。然而,当出现故障时,大多数商用服务机器人需要返回初始位置和朝向以恢复正常运行。本文聚焦于服务机器人的端到端重定位问题,旨在通过神经网络仅依据车载传感器数据直接预测机器人位姿。具体而言,我们提出一种基于相机与二维激光雷达传感器融合的重定位深度神经网络架构,称为FusionLoc。该方法中,多头自注意力机制可互补两种传感器捕获的不同类型信息。我们在商用服务机器人采集数据集上的实验表明,与仅依赖单幅图像或二维激光雷达点云的已有重定位方法,以及简单拼接特征的传统融合方法相比,FusionLoc能够提供更优的性能。