PhysWorld：通过物理感知演示合成从真实视频到可变形物体的世界模型 (PhysWorld: From Real Videos to World Models of Deformable Objects via Physics-Aware Demonstration Synthesis)

Interactive world models that simulate object dynamics are crucial for robotics, VR, and AR. However, it remains a significant challenge to learn physics-consistent dynamics models from limited real-world video data, especially for deformable objects with spatially-varying physical properties. To overcome the challenge of data scarcity, we propose PhysWorld, a novel framework that utilizes a simulator to synthesize physically plausible and diverse demonstrations to learn efficient world models. Specifically, we first construct a physics-consistent digital twin within MPM simulator via constitutive model selection and global-to-local optimization of physical properties. Subsequently, we apply part-aware perturbations to the physical properties and generate various motion patterns for the digital twin, synthesizing extensive and diverse demonstrations. Finally, using these demonstrations, we train a lightweight GNN-based world model that is embedded with physical properties. The real video can be used to further refine the physical properties. PhysWorld achieves accurate and fast future predictions for various deformable objects, and also generalizes well to novel interactions. Experiments show that PhysWorld has competitive performance while enabling inference speeds 47 times faster than the recent state-of-the-art method, i.e., PhysTwin.

翻译：交互式世界模型能够模拟物体动力学，对机器人学、虚拟现实和增强现实至关重要。然而，从有限的真实世界视频数据中学习物理一致的动力学模型仍然是一个重大挑战，特别是对于具有空间变化物理属性的可变形物体。为克服数据稀缺的挑战，我们提出了PhysWorld，一个新颖的框架，它利用模拟器合成物理合理且多样化的演示，以学习高效的世界模型。具体而言，我们首先通过本构模型选择和物理属性的全局到局部优化，在MPM模拟器中构建一个物理一致的数字孪生体。随后，我们对物理属性施加部件感知扰动，并为数字孪生体生成各种运动模式，从而合成大量且多样化的演示。最后，利用这些演示，我们训练了一个嵌入物理属性的轻量级基于GNN的世界模型。真实视频可用于进一步细化物理属性。PhysWorld能够对各种可变形物体进行准确且快速的未来预测，并且能很好地泛化到新的交互场景。实验表明，PhysWorld具有有竞争力的性能，同时其推理速度比当前最先进的方法（即PhysTwin）快47倍。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日