For the best human-robot interaction experience, the robot's navigation policy should take into account personal preferences of the user. In this paper, we present a learning framework complemented by a perception pipeline to train a depth vision-based, personalized navigation controller from user demonstrations. Our virtual reality interface enables the demonstration of robot navigation trajectories under motion of the user for dynamic interaction scenarios. The novel perception pipeline enrolls a variational autoencoder in combination with a motion predictor. It compresses the perceived depth images to a latent state representation to enable efficient reasoning of the learning agent about the robot's dynamic environment. In a detailed analysis and ablation study, we evaluate different configurations of the perception pipeline. To further quantify the navigation controller's quality of personalization, we develop and apply a novel metric to measure preference reflection based on the Fr\'echet Distance. We discuss the robot's navigation performance in various virtual scenes and demonstrate the first personalized robot navigation controller that solely relies on depth images. A supplemental video highlighting our approach is available online.
翻译:为获得最佳人机交互体验,机器人的导航策略应充分考虑用户的个人偏好。本文提出一种基于感知管线的学习框架,通过用户演示训练基于深度视觉的个性化导航控制器。我们的虚拟现实接口使用户能够在动态交互场景中演示随自身运动产生的机器人导航轨迹。该新型感知管线采用变分自编码器与运动预测器相结合的方式,将采集的深度图像压缩为潜在状态表征,使学习智能体能够高效推理机器人所处的动态环境。通过详细分析与消融研究,我们评估了感知管线的不同配置。为量化导航控制器的个性化质量,我们基于弗雷歇距离(Fréchet Distance)提出并应用了一种新型偏好反射度量指标。我们讨论了机器人在多种虚拟场景中的导航性能,并首次展示了仅依赖深度图像的个性化机器人导航控制器。本方法的补充视频已在线发布。