基于学习进度的自动课程强化学习框架在复杂地形运动中的规模化应用 (Scaling Rough Terrain Locomotion with Automatic Curriculum Reinforcement Learning)

Curriculum learning has demonstrated substantial effectiveness in robot learning. However, it still faces limitations when scaling to complex, wide-ranging task spaces. Such task spaces often lack a well-defined difficulty structure, making the difficulty ordering required by previous methods challenging to define. We propose a Learning Progress-based Automatic Curriculum Reinforcement Learning (LP-ACRL) framework, which estimates the agent's learning progress online and adaptively adjusts the task-sampling distribution, thereby enabling automatic curriculum generation without prior knowledge of the difficulty distribution over the task space. Policies trained with LP-ACRL enable the ANYmal D quadruped to achieve and maintain stable, high-speed locomotion at 2.5 m/s linear velocity and 3.0 rad/s angular velocity across diverse terrains, including stairs, slopes, gravel, and low-friction flat surfaces--whereas previous methods have generally been limited to high speeds on flat terrain or low speeds on complex terrain. Experimental results demonstrate that LP-ACRL exhibits strong scalability and real-world applicability, providing a robust baseline for future research on curriculum generation in complex, wide-ranging robotic learning task spaces.

翻译：课程学习在机器人学习领域已展现出显著成效。然而，当扩展到复杂且广泛的任务空间时，该方法仍面临局限性。此类任务空间通常缺乏明确定义的难度结构，使得先前方法所需的难度排序难以界定。我们提出了一种基于学习进度的自动课程强化学习框架，该框架在线评估智能体的学习进度并自适应调整任务采样分布，从而能够在无需先验任务空间难度分布知识的情况下实现自动课程生成。通过LP-ACRL训练的策略使ANYmal D四足机器人能够在包括楼梯、斜坡、碎石和低摩擦平面在内的多样化地形上，以2.5米/秒的线速度和3.0弧度/秒的角速度实现并保持稳定高速运动——而先前方法通常仅限于在平坦地形实现高速运动或在复杂地形进行低速运动。实验结果表明，LP-ACRL展现出强大的可扩展性和现实适用性，为未来复杂广泛机器人学习任务空间中的课程生成研究提供了稳健的基准。

相关内容

课程

关注 6

课程是指学校学生所应学习的学科总和及其进程与安排。课程是对教育的目标、教学内容、教学活动方式的规划和设计，是教学计划、教学大纲等诸多方面实施过程的总和。广义的课程是指学校为实现培养目标而选择的教育内容及其进程的总和，它包括学校老师所教授的各门学科和有目的、有计划的教育活动。狭义的课程是指某一门学科。专知上对国内外最新AI+X的课程进行了收集与索引，涵盖斯坦福大学、CMU、MIT、清华、北大等名校开放课程。

深度强化学习与模仿学习导论

专知会员服务

25+阅读 · 2025年12月10日

【博士论文】大规模人工智能中的强化学习智能体：高效训练与更严谨分析

专知会员服务

16+阅读 · 2025年7月1日

基于学习机制的多智能体强化学习综述

专知会员服务

61+阅读 · 2024年4月16日

【干货书】基于模型的强化学习:使用python工具箱从数据到连续动作，275页pdf

专知会员服务

65+阅读 · 2022年12月21日