GAP-RL：将抓取视为点用于动态物体抓取的强化学习 (GAP-RL: Grasps As Points for RL Towards Dynamic Object Grasping)

Dynamic grasping of moving objects in complex, continuous motion scenarios remains challenging. Reinforcement Learning (RL) has been applied in various robotic manipulation tasks, benefiting from its closed-loop property. However, existing RL-based methods do not fully explore the potential for enhancing visual representations. In this letter, we propose a novel framework called Grasps As Points for RL (GAP-RL) to effectively and reliably grasp moving objects. By implementing a fast region-based grasp detector, we build a Grasp Encoder by transforming 6D grasp poses into Gaussian points and extracting grasp features as a higher-level abstraction than the original object point features. Additionally, we develop a Graspable Region Explorer for real-world deployment, which searches for consistent graspable regions, enabling smoother grasp generation and stable policy execution. To assess the performance fairly, we construct a simulated dynamic grasping benchmark involving objects with various complex motions. Experiment results demonstrate that our method effectively generalizes to novel objects and unseen dynamic motions compared to other baselines. Real-world experiments further validate the framework's sim-to-real transferability.

翻译：在复杂、连续运动场景中动态抓取移动物体仍然具有挑战性。强化学习（RL）凭借其闭环特性，已被应用于各种机器人操作任务。然而，现有的基于RL的方法未能充分挖掘增强视觉表征的潜力。在本研究中，我们提出了一种称为“将抓取视为点用于强化学习”（GAP-RL）的新颖框架，以实现高效可靠的移动物体抓取。通过实现一个快速的基于区域的抓取检测器，我们将6D抓取姿态转换为高斯点，并提取抓取特征作为比原始物体点特征更高层次的抽象，从而构建了一个抓取编码器。此外，我们开发了一个用于实际部署的可抓取区域探索器，它能够搜索一致的可抓取区域，从而实现更平滑的抓取生成和更稳定的策略执行。为了公平评估性能，我们构建了一个模拟动态抓取基准测试，涉及具有各种复杂运动模式的物体。实验结果表明，与其他基线方法相比，我们的方法能有效地泛化到新物体和未见过的动态运动。真实世界实验进一步验证了该框架从仿真到现实的迁移能力。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日