PhyPlan: Compositional and Adaptive Physical Task Reasoning with Physics-Informed Skill Networks for Robot Manipulators

Given the task of positioning a ball-like object to a goal region beyond direct reach, humans can often throw, slide, or rebound objects against the wall to attain the goal. However, enabling robots to reason similarly is non-trivial. Existing methods for physical reasoning are data-hungry and struggle with complexity and uncertainty inherent in the real world. This paper presents PhyPlan, a novel physics-informed planning framework that combines physics-informed neural networks (PINNs) with modified Monte Carlo Tree Search (MCTS) to enable embodied agents to perform dynamic physical tasks. PhyPlan leverages PINNs to simulate and predict outcomes of actions in a fast and accurate manner and uses MCTS for planning. It dynamically determines whether to consult a PINN-based simulator (coarse but fast) or engage directly with the actual environment (fine but slow) to determine optimal policy. Evaluation with robots in simulated 3D environments demonstrates the ability of our approach to solve 3D-physical reasoning tasks involving the composition of dynamic skills. Quantitatively, PhyPlan excels in several aspects: (i) it achieves lower regret when learning novel tasks compared to state-of-the-art, (ii) it expedites skill learning and enhances the speed of physical reasoning, (iii) it demonstrates higher data efficiency compared to a physics un-informed approach.

翻译：将球形物体直接放置到目标区域的任务对人类而言可以通过投掷、滑动或利用墙壁反弹物体来实现，但让机器人具备类似的物理推理能力并非易事。现有物理推理方法对数据需求量大，且难以应对真实世界固有的复杂性和不确定性。本文提出PhyPlan——一种新型物理信息规划框架，它结合物理信息神经网络（PINNs）与改进的蒙特卡洛树搜索（MCTS），使具身智能体能够执行动态物理任务。PhyPlan利用PINNs快速准确地模拟和预测动作结果，并通过MCTS进行规划，动态决策是使用基于PINN的模拟器（粗略但快速）还是直接与环境交互（精确但缓慢）以确定最优策略。在模拟三维环境中对机器人进行的评估表明，该方法能解决涉及动态技能组合的3D物理推理任务。定量分析显示，PhyPlan在以下方面表现优异：（i）与现有最优方法相比，学习新任务时的遗憾值更低；（ii）加速技能学习并提升物理推理速度；（iii）数据效率显著高于未引入物理信息的方法。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日