Exploring Large Language Models to Facilitate Variable Autonomy for Human-Robot Teaming

In a rapidly evolving digital landscape autonomous tools and robots are becoming commonplace. Recognizing the significance of this development, this paper explores the integration of Large Language Models (LLMs) like Generative pre-trained transformer (GPT) into human-robot teaming environments to facilitate variable autonomy through the means of verbal human-robot communication. In this paper, we introduce a novel framework for such a GPT-powered multi-robot testbed environment, based on a Unity Virtual Reality (VR) setting. This system allows users to interact with robot agents through natural language, each powered by individual GPT cores. By means of OpenAI's function calling, we bridge the gap between unstructured natural language input and structure robot actions. A user study with 12 participants explores the effectiveness of GPT-4 and, more importantly, user strategies when being given the opportunity to converse in natural language within a multi-robot environment. Our findings suggest that users may have preconceived expectations on how to converse with robots and seldom try to explore the actual language and cognitive capabilities of their robot collaborators. Still, those users who did explore where able to benefit from a much more natural flow of communication and human-like back-and-forth. We provide a set of lessons learned for future research and technical implementations of similar systems.

翻译：在快速演变的数字环境中，自主工具和机器人正日益普及。认识到这一发展的重要性，本文探讨了将生成式预训练变换器（GPT）等大型语言模型（LLMs）集成到人机协作环境中，通过言语人机通信实现可变自主性。本文介绍了一种基于Unity虚拟现实（VR）环境的新型框架，用于此类由GPT驱动的多机器人测试平台。该系统允许用户通过自然语言与机器人代理交互，每个代理均由独立的GPT核心驱动。借助OpenAI的函数调用功能，我们弥合了非结构化自然语言输入与结构化机器人动作之间的鸿沟。一项包含12名参与者的用户研究探讨了GPT-4的有效性，更重要的是，研究了用户在多机器人环境中获得自然语言对话机会时所采取的策略。我们的研究结果表明，用户可能对如何与机器人对话存在先入为主的期望，很少尝试探索机器人协作者的实际语言和认知能力。尽管如此，那些进行探索的用户能够受益于更自然的沟通流程和类人化的双向交互。我们为未来类似系统的研究和技术实现提供了一系列经验教训。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日