Distilling Diffusion Models to Efficient 3D LiDAR Scene Completion

Diffusion models have been applied to 3D LiDAR scene completion due to their strong training stability and high completion quality. However, the slow sampling speed limits the practical application of diffusion-based scene completion models since autonomous vehicles require an efficient perception of surrounding environments. This paper proposes a novel distillation method tailored for 3D LiDAR scene completion models, dubbed $\textbf{ScoreLiDAR}$, which achieves efficient yet high-quality scene completion. ScoreLiDAR enables the distilled model to sample in significantly fewer steps after distillation. To improve completion quality, we also introduce a novel $\textbf{Structural Loss}$, which encourages the distilled model to capture the geometric structure of the 3D LiDAR scene. The loss contains a scene-wise term constraining the holistic structure and a point-wise term constraining the key landmark points and their relative configuration. Extensive experiments demonstrate that ScoreLiDAR significantly accelerates the completion time from 30.55 to 5.37 seconds per frame ($>$5$\times$) on SemanticKITTI and achieves superior performance compared to state-of-the-art 3D LiDAR scene completion models. Our code is publicly available at https://github.com/happyw1nd/ScoreLiDAR.

翻译：扩散模型因其强大的训练稳定性和高补全质量而被应用于三维激光雷达场景补全。然而，由于自动驾驶车辆需要对周围环境进行高效感知，缓慢的采样速度限制了基于扩散的场景补全模型的实际应用。本文提出了一种专为三维激光雷达场景补全模型设计的新型蒸馏方法，称为 $\textbf{ScoreLiDAR}$，该方法实现了高效且高质量的場景补全。ScoreLiDAR 使蒸馏后的模型能够在显著更少的步骤中进行采样。为了提高补全质量，我们还引入了一种新颖的 $\textbf{结构损失}$，该损失鼓励蒸馏模型捕捉三维激光雷达场景的几何结构。该损失包含一个约束整体结构的场景级项和一个约束关键地标点及其相对配置的点级项。大量实验表明，ScoreLiDAR 在 SemanticKITTI 数据集上将每帧补全时间从 30.55 秒显著加速至 5.37 秒（$>$5$\times$），并且与最先进的三维激光雷达场景补全模型相比，实现了更优的性能。我们的代码公开在 https://github.com/happyw1nd/ScoreLiDAR。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Query2box: 使用盒嵌入对向量空间中的知识图谱进行推理，Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings

专知会员服务

46+阅读 · 2020年5月11日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日