This paper presents Swarm-GPT, a system that integrates large language models (LLMs) with safe swarm motion planning - offering an automated and novel approach to deployable drone swarm choreography. Swarm-GPT enables users to automatically generate synchronized drone performances through natural language instructions. With an emphasis on safety and creativity, Swarm-GPT addresses a critical gap in the field of drone choreography by integrating the creative power of generative models with the effectiveness and safety of model-based planning algorithms. This goal is achieved by prompting the LLM to generate a unique set of waypoints based on extracted audio data. A trajectory planner processes these waypoints to guarantee collision-free and feasible motion. Results can be viewed in simulation prior to execution and modified through dynamic re-prompting. Sim-to-real transfer experiments demonstrate Swarm-GPT's ability to accurately replicate simulated drone trajectories, with a mean sim-to-real root mean square error (RMSE) of 28.7 mm. To date, Swarm-GPT has been successfully showcased at three live events, exemplifying safe real-world deployment of pre-trained models.
翻译:本文提出Swarm-GPT系统,该系统将大语言模型(LLMs)与安全集群运动规划相结合,为可部署无人机集群编舞提供了一种自动化且新颖的方法。Swarm-GPT使用户能够通过自然语言指令自动生成同步的无人机表演。在强调安全性与创造性的前提下,Swarm-GPT通过将生成模型的创意能力与基于模型的规划算法的有效性和安全性相结合,弥补了无人机编舞领域的关键空白。这一目标通过提示LLM基于提取的音频数据生成一组独特航路点来实现,轨迹规划器则对这些航路点进行处理以确保无碰撞且可行的运动。结果可在执行前通过仿真查看,并可通过动态重提示进行修改。仿真到真实世界的迁移实验表明,Swarm-GPT能够精确复现模拟无人机轨迹,其平均仿真到真实均方根误差(RMSE)为28.7毫米。迄今为止,Swarm-GPT已在三场现场活动中成功展示,例证了预训练模型在真实世界中的安全部署。