Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction - 专知论文

会员服务 ·

0

表面重建 · 重建 · 重建方法 · 细粒度 · 网格 ·

2023 年 4 月 11 日

Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction

翻译：Voxurf: 基于体素的高效精确神经表面重建

Tong Wu,Jiaqi Wang,Xingang Pan,Xudong Xu,Christian Theobalt,Ziwei Liu,Dahua Lin

from arxiv, ICLR 2023 Spotlight. Our code is available at https://github.com/wutong16/Voxurf

Neural surface reconstruction aims to reconstruct accurate 3D surfaces based on multi-view images. Previous methods based on neural volume rendering mostly train a fully implicit model with MLPs, which typically require hours of training for a single scene. Recent efforts explore the explicit volumetric representation to accelerate the optimization via memorizing significant information with learnable voxel grids. However, existing voxel-based methods often struggle in reconstructing fine-grained geometry, even when combined with an SDF-based volume rendering scheme. We reveal that this is because 1) the voxel grids tend to break the color-geometry dependency that facilitates fine-geometry learning, and 2) the under-constrained voxel grids lack spatial coherence and are vulnerable to local minima. In this work, we present Voxurf, a voxel-based surface reconstruction approach that is both efficient and accurate. Voxurf addresses the aforementioned issues via several key designs, including 1) a two-stage training procedure that attains a coherent coarse shape and recovers fine details successively, 2) a dual color network that maintains color-geometry dependency, and 3) a hierarchical geometry feature to encourage information propagation across voxels. Extensive experiments show that Voxurf achieves high efficiency and high quality at the same time. On the DTU benchmark, Voxurf achieves higher reconstruction quality with a 20x training speedup compared to previous fully implicit methods. Our code is available at https://github.com/wutong16/Voxurf.

翻译：神经表面重建旨在基于多视图图像重建精确的三维表面。以往基于神经体渲染的方法大多使用多层感知机（MLP）训练一个完全隐式模型，这类方法通常需要数小时才能完成单个场景的训练。近期工作探索了利用可学习的体素网格存储显著信息，通过显式体素表示来加速优化过程。然而，现有的基于体素的方法即使结合带符号距离函数（SDF）的体渲染方案，在重建精细几何结构时仍面临挑战。我们揭示其原因在于：1) 体素网格倾向于破坏有助于精细几何学习的颜色-几何依赖性；2) 欠约束的体素网格缺乏空间一致性，且易陷入局部最优解。本文提出Voxurf——一种兼具高效性与精确性的基于体素的表面重建方法。Voxurf通过多项关键设计解决上述问题，包括：1) 两阶段训练流程，依次获取一致的粗粒度形状并恢复精细细节；2) 双路颜色网络以维持颜色-几何依赖性；3) 层级几何特征以促进体素间的信息传播。大量实验表明，Voxurf同时实现了高效率与高质量。在DTU基准测试中，相比以往的完全隐式方法，Voxurf在实现20倍训练加速的同时获得了更高的重建质量。我们的代码已开源在 https://github.com/wutong16/Voxurf。

0

相关内容

表面重建

【NeurIPS 2021-康奈尔大学Guandao Yang】基于神经场的几何处理，Geometry Processing with Neural Fields

【NeurIPS 2021-康奈尔大学Guandao Yang】基于神经场的几何处理，Geometry Processing with Neural Fields

专知会员服务

25+阅读 · 2022年3月27日

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

专知会员服务

53+阅读 · 2020年6月7日

【ACL2020-亚马逊】Transformers多分辨率和多模态语音识别，Multiresolution and Multimodal Speech Recognition with Transformers

【ACL2020-亚马逊】Transformers多分辨率和多模态语音识别，Multiresolution and Multimodal Speech Recognition with Transformers

专知会员服务

15+阅读 · 2020年5月5日

【IJCAI2020】神经摘要结构性注意力，Neural Abstractive Summarization with Structural Attention

【IJCAI2020】神经摘要结构性注意力，Neural Abstractive Summarization with Structural Attention

专知会员服务

33+阅读 · 2020年4月24日

【CVPR2020】实例感知、上下文聚焦和内存有效的弱监督目标检测，Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

【CVPR2020】实例感知、上下文聚焦和内存有效的弱监督目标检测，Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

专知会员服务

34+阅读 · 2020年4月11日

深度学习生物图像重建综述，Deep Learning for Biomedical Image Reconstruction: A Survey

深度学习生物图像重建综述，Deep Learning for Biomedical Image Reconstruction: A Survey

专知会员服务

40+阅读 · 2020年3月2日

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

专知会员服务

78+阅读 · 2020年2月25日

【中科院计算所】深几何学习综述:从表征的角度，A Survey on Deep Geometry Learning: From a Representation Perspective

【中科院计算所】深几何学习综述:从表征的角度，A Survey on Deep Geometry Learning: From a Representation Perspective

专知会员服务

51+阅读 · 2020年2月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

没有3D卷积的3D重建方法，A100上重建一帧仅需70ms

没有3D卷积的3D重建方法，A100上重建一帧仅需70ms

机器之心

0+阅读 · 2022年9月13日

三维重建 3D reconstruction 有哪些实用算法？

三维重建 3D reconstruction 有哪些实用算法？

极市平台

13+阅读 · 2020年2月23日

简评 | Video Action Recognition 的近期进展

简评 | Video Action Recognition 的近期进展

极市平台

20+阅读 · 2019年4月21日

【泡泡图灵智库】PointNet：用于三维分类与分割的点集深度学习（CVPR）

【泡泡图灵智库】PointNet：用于三维分类与分割的点集深度学习（CVPR）

泡泡机器人SLAM

11+阅读 · 2019年1月20日

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

泡泡机器人SLAM

23+阅读 · 2019年1月18日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

泡泡机器人SLAM

13+阅读 · 2018年12月20日

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

泡泡机器人SLAM

11+阅读 · 2018年3月31日

可解释的CNN

可解释的CNN

CreateAMind

18+阅读 · 2017年10月5日

面向众核处理器的HEVC并行编码关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向三维服装建模的形状分析与处理方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

一种含有复杂外形运动物体的高效IB-LBM算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

模拟人类视觉系统的基于图像的快速三维建模方法

国家自然科学基金

0+阅读 · 2011年12月31日

面向复杂形体几何建模的特征曲线网重用

国家自然科学基金

0+阅读 · 2011年12月31日

基于形状分析的植物建模与快速绘制

国家自然科学基金

0+阅读 · 2009年12月31日

基于2D视频视觉关注度的3D重建方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于Sparse-Land模型的SAR图像噪声抑制与分割

国家自然科学基金

0+阅读 · 2009年12月31日

基于合成基准测试程序的多核处理器模拟技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

Towards a Robust Framework for NeRF Evaluation

Arxiv

0+阅读 · 2023年5月29日

Volume Feature Rendering for Fast Neural Radiance Field Reconstruction

Arxiv

0+阅读 · 2023年5月29日

FastMESH: Fast Surface Reconstruction by Hexagonal Mesh-based Neural Rendering

Arxiv

0+阅读 · 2023年5月29日

NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images

Arxiv

0+阅读 · 2023年5月27日

PlaNeRF: SVD Unsupervised 3D Plane Regularization for NeRF Large-Scale Scene Reconstruction

Arxiv

0+阅读 · 2023年5月26日

BEV-IO: Enhancing Bird's-Eye-View 3D Detection with Instance Occupancy

Arxiv

0+阅读 · 2023年5月26日

Deep Radar Inverse Sensor Models for Dynamic Occupancy Grid Maps

Arxiv

0+阅读 · 2023年5月25日

ACAI: Extending Arm Confidential Computing Architecture Protection from CPUs to Accelerators

Arxiv

0+阅读 · 2023年5月25日

HypLiLoc: Towards Effective LiDAR Pose Regression with Hyperbolic Fusion

Arxiv

0+阅读 · 2023年5月25日

Deep Generative Model for Simultaneous Range Error Mitigation and Environment Identification

Arxiv

0+阅读 · 2023年5月23日

VIP会员

文章信息

相关主题

最新内容

装甲突击旅：现代战争思考、战斗与组织

装甲突击旅：现代战争思考、战斗与组织

专知会员服务

0+阅读 · 13分钟前

在人工智能加速决策环境中拓展OODA循环

在人工智能加速决策环境中拓展OODA循环

专知会员服务

0+阅读 · 23分钟前

《廉价自杀式无人机战争的军事战略影响：乌克兰与伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰与伊朗案例研究》

专知会员服务

0+阅读 · 34分钟前

军事欺骗：供作战战术指挥官使用的工具

军事欺骗：供作战战术指挥官使用的工具

专知会员服务

0+阅读 · 38分钟前

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

3+阅读 · 6月23日

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

4+阅读 · 6月23日

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

7+阅读 · 6月23日

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

3+阅读 · 6月23日

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

4+阅读 · 6月23日

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

7+阅读 · 6月23日

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

5+阅读 · 6月23日

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

3+阅读 · 6月23日

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

6+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

8+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

8+阅读 · 6月22日

相关VIP内容

【NeurIPS 2021-康奈尔大学Guandao Yang】基于神经场的几何处理，Geometry Processing with Neural Fields

【NeurIPS 2021-康奈尔大学Guandao Yang】基于神经场的几何处理，Geometry Processing with Neural Fields

专知会员服务

25+阅读 · 2022年3月27日

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

专知会员服务

53+阅读 · 2020年6月7日

【ACL2020-亚马逊】Transformers多分辨率和多模态语音识别，Multiresolution and Multimodal Speech Recognition with Transformers

【ACL2020-亚马逊】Transformers多分辨率和多模态语音识别，Multiresolution and Multimodal Speech Recognition with Transformers

专知会员服务

15+阅读 · 2020年5月5日

【IJCAI2020】神经摘要结构性注意力，Neural Abstractive Summarization with Structural Attention

【IJCAI2020】神经摘要结构性注意力，Neural Abstractive Summarization with Structural Attention

专知会员服务

33+阅读 · 2020年4月24日

【CVPR2020】实例感知、上下文聚焦和内存有效的弱监督目标检测，Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

【CVPR2020】实例感知、上下文聚焦和内存有效的弱监督目标检测，Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection

专知会员服务

34+阅读 · 2020年4月11日

深度学习生物图像重建综述，Deep Learning for Biomedical Image Reconstruction: A Survey

深度学习生物图像重建综述，Deep Learning for Biomedical Image Reconstruction: A Survey

专知会员服务

40+阅读 · 2020年3月2日

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

专知会员服务

78+阅读 · 2020年2月25日

【中科院计算所】深几何学习综述:从表征的角度，A Survey on Deep Geometry Learning: From a Representation Perspective

【中科院计算所】深几何学习综述:从表征的角度，A Survey on Deep Geometry Learning: From a Representation Perspective

专知会员服务

51+阅读 · 2020年2月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《廉价自杀式无人机战争的军事战略影响：乌克兰与伊朗案例研究》

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

在人工智能加速决策环境中拓展OODA循环

军事欺骗：供作战战术指挥官使用的工具

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

没有3D卷积的3D重建方法，A100上重建一帧仅需70ms

没有3D卷积的3D重建方法，A100上重建一帧仅需70ms

机器之心

0+阅读 · 2022年9月13日

三维重建 3D reconstruction 有哪些实用算法？

三维重建 3D reconstruction 有哪些实用算法？

极市平台

13+阅读 · 2020年2月23日

简评 | Video Action Recognition 的近期进展

简评 | Video Action Recognition 的近期进展

极市平台

20+阅读 · 2019年4月21日

【泡泡图灵智库】PointNet：用于三维分类与分割的点集深度学习（CVPR）

【泡泡图灵智库】PointNet：用于三维分类与分割的点集深度学习（CVPR）

泡泡机器人SLAM

11+阅读 · 2019年1月20日

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

【泡泡一分钟】DS-SLAM: 动态环境下的语义视觉SLAM

泡泡机器人SLAM

23+阅读 · 2019年1月18日

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

【泡泡一分钟】基于运动估计的激光雷达和相机标定方法

泡泡机器人SLAM

25+阅读 · 2019年1月17日

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

【泡泡一分钟】Trifo-VIO：使用点和线的稳健且高效的双目视觉惯导里程计

泡泡机器人SLAM

13+阅读 · 2018年12月20日

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

【泡泡一分钟】基于多视图卷积网络的草图三维重建技术(3dv-66)

泡泡机器人SLAM

11+阅读 · 2018年3月31日

可解释的CNN

可解释的CNN

CreateAMind

18+阅读 · 2017年10月5日

相关论文

Towards a Robust Framework for NeRF Evaluation

Arxiv

0+阅读 · 2023年5月29日

Volume Feature Rendering for Fast Neural Radiance Field Reconstruction

Arxiv

0+阅读 · 2023年5月29日

FastMESH: Fast Surface Reconstruction by Hexagonal Mesh-based Neural Rendering

Arxiv

0+阅读 · 2023年5月29日

NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview Images

Arxiv

0+阅读 · 2023年5月27日

PlaNeRF: SVD Unsupervised 3D Plane Regularization for NeRF Large-Scale Scene Reconstruction

Arxiv

0+阅读 · 2023年5月26日

BEV-IO: Enhancing Bird's-Eye-View 3D Detection with Instance Occupancy

Arxiv

0+阅读 · 2023年5月26日

Deep Radar Inverse Sensor Models for Dynamic Occupancy Grid Maps

Arxiv

0+阅读 · 2023年5月25日

ACAI: Extending Arm Confidential Computing Architecture Protection from CPUs to Accelerators

Arxiv

0+阅读 · 2023年5月25日

HypLiLoc: Towards Effective LiDAR Pose Regression with Hyperbolic Fusion

Arxiv

0+阅读 · 2023年5月25日

Deep Generative Model for Simultaneous Range Error Mitigation and Environment Identification

Arxiv

0+阅读 · 2023年5月23日

相关基金

面向众核处理器的HEVC并行编码关键技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向三维服装建模的形状分析与处理方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

一种含有复杂外形运动物体的高效IB-LBM算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

模拟人类视觉系统的基于图像的快速三维建模方法

国家自然科学基金

0+阅读 · 2011年12月31日

面向复杂形体几何建模的特征曲线网重用

国家自然科学基金

0+阅读 · 2011年12月31日

基于形状分析的植物建模与快速绘制

国家自然科学基金

0+阅读 · 2009年12月31日

基于2D视频视觉关注度的3D重建方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于Sparse-Land模型的SAR图像噪声抑制与分割

国家自然科学基金

0+阅读 · 2009年12月31日

基于合成基准测试程序的多核处理器模拟技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员