Recently, reinforcement learning has become a promising and polular solution for robot legged locomotion. Compared to model-based control, reinforcement learning based controllers can achieve better robustness against uncertainties of environments through sim-to-real learning. However, the corresponding learned gaits are in general overly conservative and unatural. In this paper, we propose a new framework for learning robust, agile and natural legged locomotion skills over challenging terrain. We incorporate an adversarial training branch based on real animal locomotion data upon a teacher-student training pipeline for robust sim-to-real transfer. Empirical results on both simulation and real world of a quadruped robot demonstrate that our proposed algorithm enables robustly traversing challenging terrains such as stairs, rocky ground and slippery floor with only proprioceptive perception. Meanwhile, the gaits are more agile, natural, and energy efficient compared to the baselines. Both qualitative and quantitative results are presented in this paper.
翻译:近年来,强化学习已成为机器人腿部运动领域一种有前景且流行的解决方案。与基于模型的控制相比,通过仿真到现实学习,基于强化学习的控制器能够更好地应对环境不确定性带来的挑战,实现更强的鲁棒性。然而,相应学习到的步态通常过于保守且不自然。本文提出了一种新框架,用于在复杂地形上学习鲁棒、敏捷且自然的腿部运动技能。我们在师生训练流程的基础上,融入基于真实动物运动数据的对抗训练分支,以实现鲁棒的仿真到现实迁移。在四足机器人的仿真和真实世界中的实验结果均表明,仅依靠本体感知,我们提出的算法便能鲁棒地穿越楼梯、岩石地面和光滑地板等具有挑战性的地形。同时,与基线方法相比,其步态更加敏捷、自然且节能。本文展示了定性和定量两方面的实验结果。