Asymmetric physics enables efficient learning in quadrupedal robot swarms - 专知论文

会员服务 ·

0

Learning · 机器人 · 端到端 · 控制器 · 回合 ·

Asymmetric physics enables efficient learning in quadrupedal robot swarms

翻译：暂无翻译

Yuang Zhang,Yunlong Song,Zhihao He,Zelin Ni,Kangyu Wang,Tianchi Liu,Yu Hu,Feng Yu,Danping Zou,Weiyao Lin

Animal collectives navigate cluttered environments through local coordination, yet robot swarms still struggle to reproduce this capability in the physical world. End-to-end learning offers a route to such coordination, but scaling it to embodied swarms remains difficult: standard sampling-based reinforcement learning becomes inefficient when visual perception, dense robot-robot interaction, and contact-rich locomotion must be learned together. Here we show that asymmetric physics enables efficient end-to-end learning of vision-based, decentralized control in large swarms of quadrupedal robots. During training, quadrupeds interact in shared environments, where a high-fidelity, non-differentiable simulator generates realistic motion and contact dynamics, and differentiable surrogate models provide gradients for navigation and locomotion policies. This separation enables up to 512 quadrupeds to learn coordinated navigation policies in obstacle-rich environments. At deployment, each robot acts from a single forward-facing depth camera, without explicit communication, centralized planning, or global maps. The policies generalize across forests, bridges, enclosures, narrow passages, and mazes, and zero-shot transfer to six physical quadrupeds across five real-world scenarios. The resulting swarms exhibit predictive avoidance, right-side yielding, pausing before bottlenecks, and wall following, showing that asymmetric physics enables efficient training of scalable decentralized control policies for quadrupedal robot swarms.

翻译：暂无翻译

0

相关内容

Learning

【伯克利博士论文】物理世界中可泛化且可扩展的机器人学习

【伯克利博士论文】物理世界中可泛化且可扩展的机器人学习

专知会员服务

22+阅读 · 1月18日

《农业中的人工智能：作物、水产养殖与畜牧业中深度学习技术综述》

《农业中的人工智能：作物、水产养殖与畜牧业中深度学习技术综述》

专知会员服务

20+阅读 · 2025年7月31日

【斯坦福大学博士论文】学习连续体机器人控制中的主要动力学

【斯坦福大学博士论文】学习连续体机器人控制中的主要动力学

专知会员服务

16+阅读 · 2025年4月19日

Nat. Biotechnol. | 机器学习为生物库驱动的药物发现提供动力

Nat. Biotechnol. | 机器学习为生物库驱动的药物发现提供动力

专知会员服务

11+阅读 · 2022年9月12日

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

专知会员服务

38+阅读 · 2022年3月24日

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

专知会员服务

36+阅读 · 2019年12月21日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

前沿：水下机器人及其导航系统

前沿：水下机器人及其导航系统

科学出版社

11+阅读 · 2019年6月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

使用强化学习训练机械臂完成人类任务

使用强化学习训练机械臂完成人类任务

AI研习社

14+阅读 · 2019年3月23日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

【Science机器人2019子刊AI5篇新论文】不止模仿：通过学习概念的认知程序实现机器人零数据任务迁移

【Science机器人2019子刊AI5篇新论文】不止模仿：通过学习概念的认知程序实现机器人零数据任务迁移

专知

10+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

学界 | 综述论文：四大类深度迁移学习

学界 | 综述论文：四大类深度迁移学习

机器之心

17+阅读 · 2018年9月15日

水动力学条件对沉水植物富集水体重金属的影响机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

野外环境下四足机器人地形辨识与可通过性评价方法研究

国家自然科学基金

4+阅读 · 2015年12月31日

水生植物形态变化对泥水界面破坏及污染物释放的水动力学影响

国家自然科学基金

0+阅读 · 2015年12月31日

自主式水下机器人推进器的故障诊断与容错控制方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

微型动物主导的污泥减量过程中碳氮基质的转化行为及调控策略

国家自然科学基金

0+阅读 · 2015年12月31日

复杂地震环境下多源遥感影像引力智能优化分类模型与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于确定学习方法的无人水面艇智能控制研究

国家自然科学基金

17+阅读 · 2014年12月31日

混凝土反应动力学和结构形成动力学的研究及计算机模拟

国家自然科学基金

0+阅读 · 2014年12月31日

基于深度学习的特征融合在移动机器人视觉中的场景理解及研究

国家自然科学基金

12+阅读 · 2014年12月31日

基于逆向强化学习和人工智能的移动机器人自主学习方法研究

国家自然科学基金

12+阅读 · 2013年12月31日

Improving Robotic Imitation Learning via Trajectory Standardization

Arxiv

0+阅读 · 6月22日

A Neuromorphic Reinforcement Learning Framework for Efficient Pathfinding in Robotic Mobile Fulfillment Systems

Arxiv

0+阅读 · 6月22日

NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning

Arxiv

0+阅读 · 6月20日

Temporal logics and formal synthesis for robot planning and control

Arxiv

0+阅读 · 6月19日

Overcoming Imperfect Kinematics in Surgical Robotics Through Sim-to-Real Visuomotor Learning

Arxiv

0+阅读 · 6月19日

An Infrastructure-less, Control-Independent Solution to Relative Localisation of a Team of Mobile Robots using Ranging Measurements

Arxiv

0+阅读 · 6月18日

Comparative Study on Agility, Efficiency, and Impact Absorption of Bipedal Robots with Active Toes

Arxiv

0+阅读 · 6月18日

ForEnt: A Multi-Modal Dataset for Characterizing Quadruped Robot Entrapments in Forest Environments

Arxiv

0+阅读 · 6月18日

Bench-Push: Benchmarking Pushing-based Navigation and Manipulation Tasks for Mobile Robots

Arxiv

0+阅读 · 6月16日

Disturbance-Aware Aerial Robotics for Ethical Wildlife Monitoring

Arxiv

0+阅读 · 6月6日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

1+阅读 · 今天14:45

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

1+阅读 · 今天14:43

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

3+阅读 · 今天14:31

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

3+阅读 · 今天14:20

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

2+阅读 · 今天14:11

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

3+阅读 · 今天14:07

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

3+阅读 · 今天14:03

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

2+阅读 · 今天13:59

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

5+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

8+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

7+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

5+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

8+阅读 · 6月22日

相关VIP内容

【伯克利博士论文】物理世界中可泛化且可扩展的机器人学习

【伯克利博士论文】物理世界中可泛化且可扩展的机器人学习

专知会员服务

22+阅读 · 1月18日

《农业中的人工智能：作物、水产养殖与畜牧业中深度学习技术综述》

《农业中的人工智能：作物、水产养殖与畜牧业中深度学习技术综述》

专知会员服务

20+阅读 · 2025年7月31日

【斯坦福大学博士论文】学习连续体机器人控制中的主要动力学

【斯坦福大学博士论文】学习连续体机器人控制中的主要动力学

专知会员服务

16+阅读 · 2025年4月19日

Nat. Biotechnol. | 机器学习为生物库驱动的药物发现提供动力

Nat. Biotechnol. | 机器学习为生物库驱动的药物发现提供动力

专知会员服务

11+阅读 · 2022年9月12日

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

专知会员服务

38+阅读 · 2022年3月24日

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

专知会员服务

36+阅读 · 2019年12月21日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 世界动作模型：少做梦，多行动

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

美以伊冲突：无人机与人工智能的运用

相关资讯

初学者系列：Attentional Factorization Machines（AFM）详解

初学者系列：Attentional Factorization Machines（AFM）详解

专知

82+阅读 · 2019年9月16日

前沿：水下机器人及其导航系统

前沿：水下机器人及其导航系统

科学出版社

11+阅读 · 2019年6月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

使用强化学习训练机械臂完成人类任务

使用强化学习训练机械臂完成人类任务

AI研习社

14+阅读 · 2019年3月23日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

【Science机器人2019子刊AI5篇新论文】不止模仿：通过学习概念的认知程序实现机器人零数据任务迁移

【Science机器人2019子刊AI5篇新论文】不止模仿：通过学习概念的认知程序实现机器人零数据任务迁移

专知

10+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

学界 | 综述论文：四大类深度迁移学习

学界 | 综述论文：四大类深度迁移学习

机器之心

17+阅读 · 2018年9月15日

相关论文

Improving Robotic Imitation Learning via Trajectory Standardization

Arxiv

0+阅读 · 6月22日

A Neuromorphic Reinforcement Learning Framework for Efficient Pathfinding in Robotic Mobile Fulfillment Systems

Arxiv

0+阅读 · 6月22日

NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning

Arxiv

0+阅读 · 6月20日

Temporal logics and formal synthesis for robot planning and control

Arxiv

0+阅读 · 6月19日

Overcoming Imperfect Kinematics in Surgical Robotics Through Sim-to-Real Visuomotor Learning

Arxiv

0+阅读 · 6月19日

An Infrastructure-less, Control-Independent Solution to Relative Localisation of a Team of Mobile Robots using Ranging Measurements

Arxiv

0+阅读 · 6月18日

Comparative Study on Agility, Efficiency, and Impact Absorption of Bipedal Robots with Active Toes

Arxiv

0+阅读 · 6月18日

ForEnt: A Multi-Modal Dataset for Characterizing Quadruped Robot Entrapments in Forest Environments

Arxiv

0+阅读 · 6月18日

Bench-Push: Benchmarking Pushing-based Navigation and Manipulation Tasks for Mobile Robots

Arxiv

0+阅读 · 6月16日

Disturbance-Aware Aerial Robotics for Ethical Wildlife Monitoring

Arxiv

0+阅读 · 6月6日

相关基金

水动力学条件对沉水植物富集水体重金属的影响机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

野外环境下四足机器人地形辨识与可通过性评价方法研究

国家自然科学基金

4+阅读 · 2015年12月31日

水生植物形态变化对泥水界面破坏及污染物释放的水动力学影响

国家自然科学基金

0+阅读 · 2015年12月31日

自主式水下机器人推进器的故障诊断与容错控制方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

微型动物主导的污泥减量过程中碳氮基质的转化行为及调控策略

国家自然科学基金

0+阅读 · 2015年12月31日

复杂地震环境下多源遥感影像引力智能优化分类模型与算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于确定学习方法的无人水面艇智能控制研究

国家自然科学基金

17+阅读 · 2014年12月31日

混凝土反应动力学和结构形成动力学的研究及计算机模拟

国家自然科学基金

0+阅读 · 2014年12月31日

基于深度学习的特征融合在移动机器人视觉中的场景理解及研究

国家自然科学基金

12+阅读 · 2014年12月31日

基于逆向强化学习和人工智能的移动机器人自主学习方法研究

国家自然科学基金

12+阅读 · 2013年12月31日

微信扫码咨询专知VIP会员