Point Cloud Classification Using Content-based Transformer via Clustering in Feature Space

Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention, but ignore their content and fail to establish relationships between distant but relevant points. To overcome the limitation of local spatial attention, we propose a point content-based Transformer architecture, called PointConT for short. It exploits the locality of points in the feature space (content-based), which clusters the sampled points with similar features into the same class and computes the self-attention within each class, thus enabling an effective trade-off between capturing long-range dependencies and computational complexity. We further introduce an Inception feature aggregator for point cloud classification, which uses parallel structures to aggregate high-frequency and low-frequency information in each branch separately. Extensive experiments show that our PointConT model achieves a remarkable performance on point cloud shape classification. Especially, our method exhibits 90.3% Top-1 accuracy on the hardest setting of ScanObjectNN. Source code of this paper is available at https://github.com/yahuiliu99/PointConT.

翻译：近年来，Transformer架构在三维点云分类任务中已有初步探索。为降低计算复杂度，现有方法多聚焦于局部空间注意力机制，但忽视了点云的内容特征，无法建立远距离相关点之间的联系。为突破局部空间注意力的局限性，本文提出了一种基于点内容感知的Transformer架构——PointConT。该架构利用特征空间中点的局部性（基于内容），将具有相似特征的采样点聚类至同一类别，并在各类别内计算自注意力，从而有效平衡长距离依赖关系捕获与计算复杂度。我们进一步引入面向点云分类的Inception特征聚合器，通过并行结构在各分支分别聚合高频与低频信息。大量实验表明，我们的PointConT模型在点云形状分类任务中取得了卓越性能。特别地，在ScanObjectNN最难配置下，该方法实现了90.3%的Top-1准确率。本文源代码已发布于https://github.com/yahuiliu99/PointConT。

相关内容

点云

关注 50

根据激光测量原理得到的点云，包括三维坐标（XYZ）和激光反射强度（Intensity）。根据摄影测量原理得到的点云，包括三维坐标（XYZ）和颜色信息（RGB）。结合激光测量和摄影测量原理得到点云，包括三维坐标（XYZ）、激光反射强度（Intensity）和颜色信息（RGB）。在获取物体表面每个采样点的空间坐标后，得到的是一个点的集合，称之为“点云”(Point Cloud)

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

专知会员服务

22+阅读 · 2020年6月3日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日