Collect-and-Distribute Transformer for 3D Point Cloud Analysis

Although remarkable advancements have been made recently in point cloud analysis through the exploration of transformer architecture, it remains challenging to effectively learn local and global structures within point clouds. In this paper, we propose a new transformer architecture equipped with a collect-and-distribute mechanism to communicate short- and long-range contexts of point clouds, which we refer to as CDFormer. Specifically, we first utilize self-attention to capture short-range interactions within each local patch, and the updated local features are then collected into a set of proxy reference points from which we can extract long-range contexts. Afterward, we distribute the learned long-range contexts back to local points via cross-attention. To address the position clues for short- and long-range contexts, we also introduce context-aware position encoding to facilitate position-aware communications between points. We perform experiments on four popular point cloud datasets, namely ModelNet40, ScanObjectNN, S3DIS, and ShapeNetPart, for classification and segmentation. Results show the effectiveness of the proposed CDFormer, delivering several new state-of-the-art performances on point cloud classification and segmentation tasks. The code is available at \url{https://github.com/haibo-qiu/CDFormer}.

翻译：尽管近年来通过探索Transformer架构在点云分析领域取得了显著进展，但有效学习点云中的局部和全局结构仍具挑战性。本文提出一种配备收集与分发机制的新型Transformer架构（称为CDFormer），用于传递点云的短程与长程上下文。具体而言，我们首先利用自注意力机制捕获每个局部块内的短程交互，随后将更新的局部特征收集到一组代理参考点中，从中提取长程上下文；接着通过交叉注意力将学习到的长程上下文分发回局部点。针对短程与长程上下文的位置线索，我们引入上下文感知位置编码，以促进点之间的位置感知通信。在ModelNet40、ScanObjectNN、S3DIS和ShapeNetPart四个主流点云数据集上进行了分类与分割实验，结果证明了所提CDFormer的有效性，其在点云分类与分割任务中实现了多项新的最优性能。代码已在https://github.com/haibo-qiu/CDFormer开源。

相关内容

点云

关注 50

根据激光测量原理得到的点云，包括三维坐标（XYZ）和激光反射强度（Intensity）。根据摄影测量原理得到的点云，包括三维坐标（XYZ）和颜色信息（RGB）。结合激光测量和摄影测量原理得到点云，包括三维坐标（XYZ）、激光反射强度（Intensity）和颜色信息（RGB）。在获取物体表面每个采样点的空间坐标后，得到的是一个点的集合，称之为“点云”(Point Cloud)

【CVPR2020-Facebook】从检测到3D目标，FroDO: From Detections to 3D Objects

专知会员服务

33+阅读 · 2020年5月12日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日