This paper presents a novel concept for intuitive end-user programming of robots, inspired by natural interaction between humans. Natural language and supportive gestures are translated into robot programs using large language models (LLMs) and computer vision (CV). Through equally natural system feedback in the form of clarification questions and visual representations, the generated program can be reviewed and adjusted, thereby ensuring safety, transparency, and user acceptance.
翻译:本文提出了一种受人类自然交互启发的直观终端用户编程新概念。通过利用大型语言模型和计算机视觉,将自然语言及辅助性手势转化为机器人程序。借助同样自然的系统反馈形式(如澄清性问题与可视化表征),可对生成的程序进行审查与调整,从而确保安全性、透明性与用户接受度。