Morpheus：通过真实物理实验评测视频生成模型的物理推理能力 (Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments)

Chenyu Zhang,Daniil Cherniavskii,Antonios Tragoudaras,Antonios Vozikis,Thijmen Nijdam,Derck W. E. Prinzhorn,Mark Bodracska,Nicu Sebe,Andrii Zadaianchuk,Efstratios Gavves

Recent advances in image and video generation raise hopes that these models possess world modeling capabilities, the ability to generate realistic, physically plausible videos. This could revolutionize applications in robotics, autonomous driving, and scientific simulation. However, before treating these models as world models, we must ask: Do they adhere to physical conservation laws? To answer this, we introduce Morpheus, a benchmark for evaluating video generation models on physical reasoning. It features 80 real-world videos capturing physical phenomena, guided by conservation laws. Since artificial generations lack ground truth, we assess physical plausibility using physics-informed metrics evaluated with respect to infallible conservation laws known per physical setting, leveraging advances in physics-informed neural networks and vision-language foundation models. Our findings reveal that even with advanced prompting and video conditioning, current models struggle to encode physical principles despite generating aesthetically pleasing videos. All data, leaderboard, and code are open-sourced at our project page.

翻译：近期图像与视频生成领域的进展使人们期待这些模型具备世界建模能力，即生成真实且物理合理的视频。这可能为机器人学、自动驾驶和科学仿真等应用带来革命性变化。然而，在将这些模型视为世界模型之前，我们必须追问：它们是否遵循物理守恒定律？为回答这一问题，我们提出了Morpheus——一个基于物理推理评估视频生成模型的基准测试。该基准包含80段捕捉物理现象的真实世界视频，其设计遵循守恒定律指导。由于人工生成内容缺乏真实参照，我们通过基于物理信息的度量指标来评估物理合理性，这些指标依据每种物理场景中已知的绝对守恒定律进行计算，并融合了物理信息神经网络与视觉-语言基础模型的最新进展。我们的研究结果表明，即使采用先进的提示技术和视频条件控制，当前模型在生成视觉美观视频的同时，仍难以有效编码物理原理。所有数据、排行榜及代码已在项目页面开源。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日