Legged robots navigating cluttered environments must be jointly agile for efficient task execution and safe to avoid collisions with obstacles or humans. Existing studies either develop conservative controllers (< 1.0 m/s) to ensure safety, or focus on agility without considering potentially fatal collisions. This paper introduces Agile But Safe (ABS), a learning-based control framework that enables agile and collision-free locomotion for quadrupedal robots. ABS involves an agile policy to execute agile motor skills amidst obstacles and a recovery policy to prevent failures, collaboratively achieving high-speed and collision-free navigation. The policy switch in ABS is governed by a learned control-theoretic reach-avoid value network, which also guides the recovery policy as an objective function, thereby safeguarding the robot in a closed loop. The training process involves the learning of the agile policy, the reach-avoid value network, the recovery policy, and an exteroception representation network, all in simulation. These trained modules can be directly deployed in the real world with onboard sensing and computation, leading to high-speed and collision-free navigation in confined indoor and outdoor spaces with both static and dynamic obstacles.
翻译:腿式机器人在复杂环境中导航时,既需要具备敏捷性以实现高效任务执行,又需要保证安全性以避免与障碍物或人类碰撞。现有研究要么开发保守控制器(速度<1.0米/秒)确保安全,要么专注于敏捷性而未考虑可能致命的碰撞。本文提出“敏捷但安全”(Agile But Safe, ABS)这一基于学习的控制框架,使四足机器人能够实现敏捷且无碰撞的运动。ABS包含一个敏捷策略,用于在障碍物间执行敏捷运动技能,以及一个恢复策略,用于防止故障,两者协同实现高速无碰撞导航。ABS中的策略切换由学习得到的控制理论“避碰-可达”值网络控制,该网络还作为目标函数指导恢复策略,从而在闭环中保护机器人安全。训练过程涉及敏捷策略、避碰-可达值网络、恢复策略及外部感知表征网络的学习,均在仿真环境中完成。这些训练好的模块可直接部署到现实世界,通过机载传感与计算,在包含静态和动态障碍物的狭窄室内外空间中实现高速无碰撞导航。