PHUMA: Physically Reliable Humanoid Locomotion Dataset

Motion imitation is a promising approach for humanoid locomotion, enabling agents to acquire humanlike behaviors. Existing methods typically rely on high-quality motion capture datasets such as AMASS, but these are scarce and expensive, limiting scalability and diversity. Recent studies attempt to scale data collection by converting large-scale internet videos, exemplified by Humanoid-X. However, they often suffer from physical artifacts such as floating, penetration, and foot skating, which hinder stable imitation. To address this, we introduce PHUMA, a Physically Reliable HUMAnoid locomotion dataset produced by a two-stage pipeline combining physics-aware curation and physics-constrained retargeting, aggregating both motion capture and internet video into a physically reliable, 73-hour corpus. On motion tracking benchmarks, PHUMA-trained policies achieve higher success rates than those trained on AMASS and Humanoid-X, and successfully transfer zero-shot to a real Unitree G1. The code is available at https://davian-robotics.github.io/PHUMA.

翻译：摘要：运动模仿是一种很有前景的人形机器人运动生成方法，能使智能体获取类人行为。现有方法通常依赖高质量运动捕捉数据集（如AMASS），但这些数据集稀缺且昂贵，限制了可扩展性和多样性。近期研究尝试通过转换大规模互联网视频来扩展数据收集规模，例如Humanoid-X。然而，这些数据常存在物理伪影（如悬浮、穿透和足部滑动），阻碍了稳定的运动模仿。为解决这一问题，我们提出了PHUMA——一个物理可靠的人形运动数据集，通过两阶段流水线（结合物理感知筛选与物理约束重定向）构建，将运动捕捉数据和互联网视频聚合为73小时的物理可靠语料库。在运动跟踪基准测试中，基于PHUMA训练的策略比基于AMASS和Humanoid-X训练的策略实现了更高的成功率，并成功零样本迁移到真实的Unitree G1机器人上。代码已开源：https://davian-robotics.github.io/PHUMA。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【CMU博士论文】交互驱动的人体动作估计与生成

专知会员服务

18+阅读 · 2025年9月17日

面向机器人操作的基于大型视觉‑语言模型（VLM）的视觉‑语言‑动作（VLA）模型综述

专知会员服务

34+阅读 · 2025年8月19日

【CVPR2025】MixerMDM：可学习的人体运动扩散模型组合

专知会员服务

10+阅读 · 2025年4月3日

【斯坦福博士论文】构建类人化具身智能体：从人类行为中学习

专知会员服务

27+阅读 · 2025年3月20日