DiffMimic: Efficient Motion Mimicking with Differentiable Physics - 专知论文

会员服务 ·

0

物理模拟 · 角色动画 · 标准基 · 时间效率 · 收敛性 ·

2023 年 4 月 6 日

DiffMimic: Efficient Motion Mimicking with Differentiable Physics

翻译：DiffMimic: 基于可微物理的高效运动模仿

Jiawei Ren,Cunjun Yu,Siwei Chen,Xiao Ma,Liang Pan,Ziwei Liu

from arxiv, ICLR 2023; Code is at https://github.com/jiawei-ren/diffmimic; Project page is at https://diffmimic.github.io/

Motion mimicking is a foundational task in physics-based character animation. However, most existing motion mimicking methods are built upon reinforcement learning (RL) and suffer from heavy reward engineering, high variance, and slow convergence with hard explorations. Specifically, they usually take tens of hours or even days of training to mimic a simple motion sequence, resulting in poor scalability. In this work, we leverage differentiable physics simulators (DPS) and propose an efficient motion mimicking method dubbed DiffMimic. Our key insight is that DPS casts a complex policy learning task to a much simpler state matching problem. In particular, DPS learns a stable policy by analytical gradients with ground-truth physical priors hence leading to significantly faster and stabler convergence than RL-based methods. Moreover, to escape from local optima, we utilize a Demonstration Replay mechanism to enable stable gradient backpropagation in a long horizon. Extensive experiments on standard benchmarks show that DiffMimic has a better sample efficiency and time efficiency than existing methods (e.g., DeepMimic). Notably, DiffMimic allows a physically simulated character to learn Backflip after 10 minutes of training and be able to cycle it after 3 hours of training, while the existing approach may require about a day of training to cycle Backflip. More importantly, we hope DiffMimic can benefit more differentiable animation systems with techniques like differentiable clothes simulation in future research.

翻译：运动模仿是基于物理的角色动画中的基础任务。然而，现有的大多数运动模仿方法基于强化学习，存在奖励工程复杂、方差高、硬探索下收敛缓慢等问题。具体而言，它们通常需要数十小时甚至数天的训练才能模仿简单的运动序列，导致可扩展性较差。本文利用可微物理模拟器，提出了一种名为DiffMimic的高效运动模仿方法。我们的关键洞察在于：可微物理模拟器将复杂的策略学习任务简化为更简单的状态匹配问题。具体而言，可微物理模拟器通过基于真实物理先验的解析梯度学习稳定策略，因此相比强化学习方法具有显著更快的收敛速度和更高的稳定性。此外，为了逃离局部最优，我们引入了演示回放机制，使其在长时域中实现稳定的梯度反向传播。在标准基准上的大量实验表明，DiffMimic在样本效率和时间效率上均优于现有方法（如DeepMimic）。特别值得注意的是，DiffMimic使物理模拟角色仅需10分钟训练即可学习后空翻，并在3小时训练后实现该动作的循环执行，而现有方法可能需要约一天训练才能循环后空翻。更重要的是，我们期望DiffMimic能通过可微布料模拟等技术支持更多可微动画系统的发展。

0

相关内容

物理模拟

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

专知会员服务

38+阅读 · 2022年3月24日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

专知会员服务

45+阅读 · 2022年3月6日

GNN在几何深度学习有何进展？斯坦福CS224W《几何深度学习》课程报告，DeepMind大牛Petar主讲，附112页ppt

GNN在几何深度学习有何进展？斯坦福CS224W《几何深度学习》课程报告，DeepMind大牛Petar主讲，附112页ppt

专知会员服务

54+阅读 · 2021年12月4日

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

专知会员服务

67+阅读 · 2020年8月22日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

专知会员服务

17+阅读 · 2020年3月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

专知会员服务

32+阅读 · 2019年10月12日

【动态】第四届新一代计算机动画技术研讨会成功举办

【动态】第四届新一代计算机动画技术研讨会成功举办

中国图象图形学学会CSIG

0+阅读 · 2022年7月25日

7 Papers & Radios | 朱松纯团队让AI读懂人类价值观；DeepMind新模型像婴儿般学习物理规则

7 Papers & Radios | 朱松纯团队让AI读懂人类价值观；DeepMind新模型像婴儿般学习物理规则

机器之心

2+阅读 · 2022年7月17日

【泡泡一分钟】通过学习轮式里程计和IMU误差的定位

【泡泡一分钟】通过学习轮式里程计和IMU误差的定位

泡泡机器人SLAM

133+阅读 · 2019年9月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

颞下颌关节连续运动三维磁共振成像的关键问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

CRABP2在牵张应力促肌腱干细胞腱系分化中的作用和机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

DNA碱基剪切修复在Gadd45a促CD4+T细胞DNA低甲基化中的作用及参与SLE发病的机制

国家自然科学基金

0+阅读 · 2014年12月31日

右室舒张功能无创定量新方法的建立及评价

国家自然科学基金

0+阅读 · 2013年12月31日

基于智能在线虚拟参考反馈整定的控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

保持时空连续变化的三维纹理变形方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

X箍缩内爆动力学过程的物理特性及二维MHD数值模拟研究

国家自然科学基金

0+阅读 · 2011年12月31日

分子转动振动波包的飞秒激光调控

国家自然科学基金

0+阅读 · 2009年12月31日

线性积分方程的Galerkin快速谱方法

国家自然科学基金

0+阅读 · 2009年12月31日

Hierarchical Path-planning from Speech Instructions with Spatial Concept-based Topometric Semantic Mapping

Arxiv

0+阅读 · 2023年5月25日

Efficient and Explicit Modelling of Image Hierarchies for Image Restoration

Arxiv

0+阅读 · 2023年5月25日

Towards Complex Dynamic Physics System Simulation with Graph Neural ODEs

Arxiv

0+阅读 · 2023年5月25日

Inverse Preference Learning: Preference-based RL without a Reward Function

Arxiv

0+阅读 · 2023年5月24日

An Informative Path Planning Framework for Active Learning in UAV-based Semantic Mapping

Arxiv

0+阅读 · 2023年5月24日

Barkour: Benchmarking Animal-level Agility with Quadruped Robots

Arxiv

0+阅读 · 2023年5月24日

SE-Bridge: Speech Enhancement with Consistent Brownian Bridge

Arxiv

0+阅读 · 2023年5月23日

SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering

Arxiv

0+阅读 · 2023年5月23日

NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields

Arxiv

15+阅读 · 2022年3月3日

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

Arxiv

19+阅读 · 2020年2月15日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

专知会员服务

2+阅读 · 6月19日

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

专知会员服务

4+阅读 · 6月19日

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

5+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

6+阅读 · 6月18日

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

专知会员服务

11+阅读 · 6月18日

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

专知会员服务

10+阅读 · 6月18日

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

专知会员服务

7+阅读 · 6月17日

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

专知会员服务

10+阅读 · 6月17日

学习数据的几何：形状空间分析数学综述

学习数据的几何：形状空间分析数学综述

专知会员服务

7+阅读 · 6月17日

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

专知会员服务

13+阅读 · 6月17日

定向能反无人机系统最新发展动态

定向能反无人机系统最新发展动态

专知会员服务

8+阅读 · 6月17日

从燃煤战舰到算法战争：水面指挥的永恒要求

从燃煤战舰到算法战争：水面指挥的永恒要求

专知会员服务

6+阅读 · 6月17日

《短程弹道再入飞行器拦截时间中的一项异常现象》

《短程弹道再入飞行器拦截时间中的一项异常现象》

专知会员服务

8+阅读 · 6月17日

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

专知会员服务

8+阅读 · 6月17日

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

专知会员服务

10+阅读 · 6月17日

相关VIP内容

JCIM丨DRlinker：深度强化学习优化片段连接设计

JCIM丨DRlinker：深度强化学习优化片段连接设计

专知会员服务

7+阅读 · 2022年12月9日

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

专知会员服务

38+阅读 · 2022年3月24日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

专知会员服务

45+阅读 · 2022年3月6日

GNN在几何深度学习有何进展？斯坦福CS224W《几何深度学习》课程报告，DeepMind大牛Petar主讲，附112页ppt

GNN在几何深度学习有何进展？斯坦福CS224W《几何深度学习》课程报告，DeepMind大牛Petar主讲，附112页ppt

专知会员服务

54+阅读 · 2021年12月4日

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

最新《模仿学习 - Imitation Learning》教程，63页ppt，微软Kamil Ciosek

专知会员服务

67+阅读 · 2020年8月22日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

【DeepMind】基于变换的大规模数据对抗视频预测，Transformation-based Adversarial Video Prediction on Large-Scale Data

专知会员服务

17+阅读 · 2020年3月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

【Pieter Abbeel 报告@CMU】元学习与深度强化学习机器人应用，Deep Learning to Learn，84页ppt

专知会员服务

32+阅读 · 2019年10月12日

热门VIP内容

开通专知VIP会员享更多权益服务

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

相关资讯

【动态】第四届新一代计算机动画技术研讨会成功举办

【动态】第四届新一代计算机动画技术研讨会成功举办

中国图象图形学学会CSIG

0+阅读 · 2022年7月25日

7 Papers & Radios | 朱松纯团队让AI读懂人类价值观；DeepMind新模型像婴儿般学习物理规则

7 Papers & Radios | 朱松纯团队让AI读懂人类价值观；DeepMind新模型像婴儿般学习物理规则

机器之心

2+阅读 · 2022年7月17日

【泡泡一分钟】通过学习轮式里程计和IMU误差的定位

【泡泡一分钟】通过学习轮式里程计和IMU误差的定位

泡泡机器人SLAM

133+阅读 · 2019年9月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

MoCoGAN 分解运动和内容的视频生成

MoCoGAN 分解运动和内容的视频生成

CreateAMind

18+阅读 · 2017年10月21日

相关论文

Hierarchical Path-planning from Speech Instructions with Spatial Concept-based Topometric Semantic Mapping

Arxiv

0+阅读 · 2023年5月25日

Efficient and Explicit Modelling of Image Hierarchies for Image Restoration

Arxiv

0+阅读 · 2023年5月25日

Towards Complex Dynamic Physics System Simulation with Graph Neural ODEs

Arxiv

0+阅读 · 2023年5月25日

Inverse Preference Learning: Preference-based RL without a Reward Function

Arxiv

0+阅读 · 2023年5月24日

An Informative Path Planning Framework for Active Learning in UAV-based Semantic Mapping

Arxiv

0+阅读 · 2023年5月24日

Barkour: Benchmarking Animal-level Agility with Quadruped Robots

Arxiv

0+阅读 · 2023年5月24日

SE-Bridge: Speech Enhancement with Consistent Brownian Bridge

Arxiv

0+阅读 · 2023年5月23日

SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering

Arxiv

0+阅读 · 2023年5月23日

NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields

Arxiv

15+阅读 · 2022年3月3日

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation

Arxiv

19+阅读 · 2020年2月15日

相关基金

颞下颌关节连续运动三维磁共振成像的关键问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

CRABP2在牵张应力促肌腱干细胞腱系分化中的作用和机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

DNA碱基剪切修复在Gadd45a促CD4+T细胞DNA低甲基化中的作用及参与SLE发病的机制

国家自然科学基金

0+阅读 · 2014年12月31日

右室舒张功能无创定量新方法的建立及评价

国家自然科学基金

0+阅读 · 2013年12月31日

基于智能在线虚拟参考反馈整定的控制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

保持时空连续变化的三维纹理变形方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Cocycle动力学和拟周期薛定谔算子的谱

国家自然科学基金

0+阅读 · 2012年12月31日

X箍缩内爆动力学过程的物理特性及二维MHD数值模拟研究

国家自然科学基金

0+阅读 · 2011年12月31日

分子转动振动波包的飞秒激光调控

国家自然科学基金

0+阅读 · 2009年12月31日

线性积分方程的Galerkin快速谱方法

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员