Recovery of an underlying scene geometry from multiview images stands as a long-time challenge in computer vision research. The recent promise leverages neural implicit surface learning and differentiable volume rendering, and achieves both the recovery of scene geometry and synthesis of novel views, where deep priors of neural models are used as an inductive smoothness bias. While promising for object-level surfaces, these methods suffer when coping with complex scene surfaces. In the meanwhile, traditional multi-view stereo can recover the geometry of scenes with rich textures, by globally optimizing the local, pixel-wise correspondences across multiple views. We are thus motivated to make use of the complementary benefits from the two strategies, and propose a method termed Helix-shaped neural implicit Surface learning or HelixSurf; HelixSurf uses the intermediate prediction from one strategy as the guidance to regularize the learning of the other one, and conducts such intertwined regularization iteratively during the learning process. We also propose an efficient scheme for differentiable volume rendering in HelixSurf. Experiments on surface reconstruction of indoor scenes show that our method compares favorably with existing methods and is orders of magnitude faster, even when some of existing methods are assisted with auxiliary training data. The source code is available at https://github.com/Gorilla-Lab-SCUT/HelixSurf.
翻译:从多视角图像恢复场景几何结构是计算机视觉研究中的长期挑战。近期进展通过结合神经隐式曲面学习与可微体积渲染,利用神经模型的深度先验作为归纳平滑偏置,同时实现了场景几何恢复与新视角合成。尽管此类方法在物体级表面表现出色,但在处理复杂场景表面时仍存在局限。与此同时,传统多视角立体可通过全局优化多视图间的局部像素级对应关系,恢复具有丰富纹理的场景几何。受此启发,我们利用两种策略的互补优势,提出名为螺旋形神经隐式曲面学习(HelixSurf)的方法。HelixSurf将某一策略的中间预测结果作为另一策略学习的正则化引导,并在学习过程中迭代执行这种交织正则化。此外,我们还为HelixSurf设计了高效的差异化体积渲染方案。室内场景曲面重建实验表明,即使现有方法借助辅助训练数据,本方法仍能以数倍于前者的速度取得更优性能。源代码已开源至https://github.com/Gorilla-Lab-SCUT/HelixSurf。