Recently, quadrupedal locomotion has achieved significant success, but their manipulation capabilities, particularly in handling large objects, remain limited, restricting their usefulness in demanding real-world applications such as search and rescue, construction, industrial automation, and room organization. This paper tackles the task of obstacle-aware, long-horizon pushing by multiple quadrupedal robots. We propose a hierarchical multi-agent reinforcement learning framework with three levels of control. The high-level controller integrates an RRT planner and a centralized adaptive policy to generate subgoals, while the mid-level controller uses a decentralized goal-conditioned policy to guide the robots toward these sub-goals. A pre-trained low-level locomotion policy executes the movement commands. We evaluate our method against several baselines in simulation, demonstrating significant improvements over baseline approaches, with 36.0% higher success rates and 24.5% reduction in completion time than the best baseline. Our framework successfully enables long-horizon, obstacle-aware manipulation tasks like Push-Cuboid and Push-T on Go1 robots in the real world.
翻译:近年来,四足机器人的运动控制已取得显著进展,但其操作能力,尤其是在处理大型物体方面,仍存在局限,这限制了它们在搜救、建筑施工、工业自动化及室内整理等实际高要求场景中的应用。本文研究了多台四足机器人在障碍物环境下的长时程推进任务。我们提出了一种分层多智能体强化学习框架,包含三层控制结构:高层控制器融合RRT规划器与集中式自适应策略以生成子目标;中层控制器采用分散式目标条件策略引导机器人朝向这些子目标运动;预训练的低层运动策略则负责执行具体的移动指令。我们在仿真环境中将所提方法与多种基线方法进行比较评估,结果表明本方法较基线有显著提升,成功率比最佳基线提高36.0%,任务完成时间减少24.5%。该框架成功在现实世界的Go1机器人上实现了如立方体推进与T型物推进等长时程、障碍物感知的操作任务。