Visual-tactile Fusion for Transparent Object Grasping in Complex Backgrounds

The accurate detection and grasping of transparent objects are challenging but of significance to robots. Here, a visual-tactile fusion framework for transparent object grasping under complex backgrounds and variant light conditions is proposed, including the grasping position detection, tactile calibration, and visual-tactile fusion based classification. First, a multi-scene synthetic grasping dataset generation method with a Gaussian distribution based data annotation is proposed. Besides, a novel grasping network named TGCNN is proposed for grasping position detection, showing good results in both synthetic and real scenes. In tactile calibration, inspired by human grasping, a fully convolutional network based tactile feature extraction method and a central location based adaptive grasping strategy are designed, improving the success rate by 36.7% compared to direct grasping. Furthermore, a visual-tactile fusion method is proposed for transparent objects classification, which improves the classification accuracy by 34%. The proposed framework synergizes the advantages of vision and touch, and greatly improves the grasping efficiency of transparent objects.

翻译：透明物体的精确检测与抓取对机器人而言具有挑战性且意义重大。本文提出了一种适用于复杂背景和变化光照条件下的透明物体抓取视触觉融合框架，包括抓取位置检测、触觉校准以及基于视触觉融合的分类。首先，提出了一种基于高斯分布数据标注的多场景合成抓取数据集生成方法。此外，提出了一种名为TGCNN的新型抓取网络用于抓取位置检测，在合成场景和真实场景中均表现出良好效果。在触觉校准方面，受人类抓取方式启发，设计了一种基于全卷积网络的触觉特征提取方法以及一种基于中心位置的自适应抓取策略，与直接抓取相比，成功率提高了36.7%。进一步提出了一种用于透明物体分类的视触觉融合方法，将分类准确率提升了34%。所提框架协同发挥了视觉与触觉的优势，显著提升了透明物体的抓取效率。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日