The ability to anticipate others' goals and intentions is at the basis of human-human social interaction. Such ability, largely based on non-verbal communication, is also a key to having natural and pleasant interactions with artificial agents, like robots. In this work, we discuss a preliminary experiment on the use of head pose as a visual cue to understand and anticipate action goals, particularly reaching and transporting movements. By reasoning on the spatio-temporal connections between the head, hands and objects in the scene, we will show that short-range anticipation is possible, laying the foundations for future applications to human-robot interaction.
翻译:预测他人目标与意图的能力是人类社会互动的基础。这种主要基于非语言交流的能力,也是实现与机器人等智能体进行自然愉悦交互的关键。本研究探讨了利用头部姿态作为视觉线索来理解与预测动作目标(特别是抓取与运送动作)的初步实验。通过分析场景中头部、手部与物体之间的时空关联,我们将证明短时预测是可行的,这为未来人机交互应用奠定了基础。