Diffusion-Occ：基于占据概率扩散的三维点云补全方法 (Diffusion-Occ: 3D Point Cloud Completion via Occupancy Diffusion)

Point clouds are crucial for capturing three-dimensional data but often suffer from incompleteness due to limitations such as resolution and occlusion. Traditional methods typically rely on point-based approaches within discriminative frameworks for point cloud completion. In this paper, we introduce \textbf{Diffusion-Occ}, a novel framework for Diffusion Point Cloud Completion. Diffusion-Occ utilizes a two-stage coarse-to-fine approach. In the first stage, the Coarse Density Voxel Prediction Network (CDNet) processes partial points to predict coarse density voxels, streamlining global feature extraction through voxel classification, as opposed to previous regression-based methods. In the second stage, we introduce the Occupancy Generation Network (OccGen), a conditional occupancy diffusion model based on a transformer architecture and enhanced by our Point-Voxel Fuse (PVF) block. This block integrates coarse density voxels with partial points to leverage both global and local features for comprehensive completion. By thresholding the occupancy field, we convert it into a complete point cloud. Additionally, our method employs diverse training mixtures and efficient diffusion parameterization to enable effective one-step sampling during both training and inference. Experimental results demonstrate that Diffusion-Occ outperforms existing discriminative and generative methods.

翻译：点云是捕获三维数据的关键，但由于分辨率和遮挡等限制，常常存在不完整的问题。传统方法通常在判别式框架内依赖基于点的方法进行点云补全。本文提出 \textbf{Diffusion-Occ}，一种用于扩散点云补全的新框架。Diffusion-Occ 采用由粗到精的两阶段方法。在第一阶段，粗粒度密度体素预测网络（CDNet）处理部分点以预测粗粒度密度体素，通过体素分类简化全局特征提取，这与先前基于回归的方法不同。在第二阶段，我们引入了占据概率生成网络（OccGen），这是一个基于 Transformer 架构的条件占据概率扩散模型，并通过我们提出的点-体素融合（PVF）模块进行增强。该模块将粗粒度密度体素与部分点云相结合，以利用全局和局部特征进行全面的补全。通过对占据概率场进行阈值化，我们将其转换为完整的点云。此外，我们的方法采用多样化的训练混合策略和高效的扩散参数化，从而在训练和推理过程中实现有效的一步采样。实验结果表明，Diffusion-Occ 的性能优于现有的判别式和生成式方法。

相关内容

点云

关注 50

根据激光测量原理得到的点云，包括三维坐标（XYZ）和激光反射强度（Intensity）。根据摄影测量原理得到的点云，包括三维坐标（XYZ）和颜色信息（RGB）。结合激光测量和摄影测量原理得到点云，包括三维坐标（XYZ）、激光反射强度（Intensity）和颜色信息（RGB）。在获取物体表面每个采样点的空间坐标后，得到的是一个点的集合，称之为“点云”(Point Cloud)

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日