3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering

Noise is an inevitable aspect of point cloud acquisition, necessitating filtering as a fundamental task within the realm of 3D vision. Existing learning-based filtering methods have shown promising capabilities on small-scale synthetic or real-world datasets. Nonetheless, the effectiveness of these methods is constrained when dealing with a substantial quantity of point clouds. This limitation primarily stems from their limited denoising capabilities for large-scale point clouds and their inclination to generate noisy outliers after denoising. The recent introduction of State Space Models (SSMs) for long sequence modeling in Natural Language Processing (NLP) presents a promising solution for handling large-scale data. Encouraged by iterative point cloud filtering methods, we introduce 3DMambaIPF, firstly incorporating Mamba (Selective SSM) architecture to sequentially handle extensive point clouds from large scenes, capitalizing on its strengths in selective input processing and long sequence modeling capabilities. Additionally, we integrate a robust and fast differentiable rendering loss to constrain the noisy points around the surface. In contrast to previous methodologies, this differentiable rendering loss enhances the visual realism of denoised geometric structures and aligns point cloud boundaries more closely with those observed in real-world objects. Extensive evaluation on datasets comprising small-scale synthetic and real-world models (typically with up to 50K points) demonstrate that our method achieves state-of-the-art results. Moreover, we showcase the superior scalability and efficiency of our method on large-scale models with about 500K points, where the majority of the existing learning-based denoising methods are unable to handle.

翻译：噪声是点云采集过程中不可避免的问题，因此滤波成为三维视觉领域的基础任务。现有基于学习的滤波方法在合成或真实小规模数据集上已展现出良好效果，但在处理大规模点云时效能受限。这一局限主要源于其在大规模点云去噪能力上的不足，以及去噪后易产生噪声离群点的倾向。自然语言处理（NLP）中近期引入的状态空间模型（SSMs）为长序列建模提供了新思路，为处理大规模数据带来了可行方案。受迭代点云滤波方法启发，我们提出3DMambaIPF，首次将Mamba（选择性SSM）架构融入其中，利用其在选择性输入处理和长序列建模方面的优势，顺序处理大型场景中的海量点云。此外，我们整合了稳健且快速的可微渲染损失函数，以约束曲面附近的噪声点。与先前方法不同，该可微渲染损失增强了去噪几何结构的视觉真实感，并使点云边界更贴近真实世界物体的观测边界。在包含小规模合成及真实模型（通常含多达5万点）的数据集上的全面评估表明，我们的方法达到了最先进水平。此外，我们展示了该方法在约50万点的大规模模型上的卓越可扩展性和效率，而现有大多数基于学习的去噪方法无法处理此类模型。

相关内容

点云

关注 50

根据激光测量原理得到的点云，包括三维坐标（XYZ）和激光反射强度（Intensity）。根据摄影测量原理得到的点云，包括三维坐标（XYZ）和颜色信息（RGB）。结合激光测量和摄影测量原理得到点云，包括三维坐标（XYZ）、激光反射强度（Intensity）和颜色信息（RGB）。在获取物体表面每个采样点的空间坐标后，得到的是一个点的集合，称之为“点云”(Point Cloud)

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日