GAMMA: Generalizable Articulation Modeling and Manipulation for Articulated Objects

Articulated objects like cabinets and doors are widespread in daily life. However, directly manipulating 3D articulated objects is challenging because they have diverse geometrical shapes, semantic categories, and kinetic constraints. Prior works mostly focused on recognizing and manipulating articulated objects with specific joint types. They can either estimate the joint parameters or distinguish suitable grasp poses to facilitate trajectory planning. Although these approaches have succeeded in certain types of articulated objects, they lack generalizability to unseen objects, which significantly impedes their application in broader scenarios. In this paper, we propose a novel framework of Generalizable Articulation Modeling and Manipulating for Articulated Objects (GAMMA), which learns both articulation modeling and grasp pose affordance from diverse articulated objects with different categories. In addition, GAMMA adopts adaptive manipulation to iteratively reduce the modeling errors and enhance manipulation performance. We train GAMMA with the PartNet-Mobility dataset and evaluate with comprehensive experiments in SAPIEN simulation and real-world Franka robot. Results show that GAMMA significantly outperforms SOTA articulation modeling and manipulation algorithms in unseen and cross-category articulated objects. We will open-source all codes and datasets in both simulation and real robots for reproduction in the final version. Images and videos are published on the project website at: http://sites.google.com/view/gamma-articulation

翻译：铰接物体（如橱柜和门）在日常生活中广泛存在。然而，直接操控三维铰接物体极具挑战性，因其几何形状、语义类别及动力学约束呈现多样性。现有研究大多集中于识别和操控具有特定关节类型的铰接物体，或通过估计关节参数，或通过区分适用抓取位姿以辅助轨迹规划。尽管这些方法在特定类型的铰接物体上取得了成功，但其对未见物体的泛化能力不足，严重阻碍了在更广泛场景中的应用。本文提出一种新颖的通用铰接建模与操控框架GAMMA，通过学习不同类别铰接物体的铰接建模及抓取位姿可供性，实现泛化。此外，GAMMA采用自适应操控策略，通过迭代建模误差以提升操控性能。我们基于PartNet-Mobility数据集训练GAMMA，并在SAPIEN仿真环境及真实Franka机器人上开展全面实验评估。结果表明，GAMMA在未见及跨类别铰接物体上显著优于现有最先进的铰接建模与操控算法。最终版本中，我们将开源所有仿真及真实机器人代码与数据集以供复现。相关图像与视频已发布于项目网站：http://sites.google.com/view/gamma-articulation

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日