In this paper, we propose a novel method for 3D scene and object reconstruction from sparse multi-view images. Different from previous methods that leverage extra information such as depth or generalizable features across scenes, our approach leverages the scene properties embedded in the multi-view inputs to create precise pseudo-labels for optimization without any prior training. Specifically, we introduce a geometry-guided approach that improves surface reconstruction accuracy from sparse views by leveraging spherical harmonics to predict the novel radiance while holistically considering all color observations for a point in the scene. Also, our pipeline exploits proxy geometry and correctly handles the occlusion in generating the pseudo-labels of radiance, which previous image-warping methods fail to avoid. Our method, dubbed Ray Augmentation (RayAug), achieves superior results on DTU and Blender datasets without requiring prior training, demonstrating its effectiveness in addressing the problem of sparse view reconstruction. Our pipeline is flexible and can be integrated into other implicit neural reconstruction methods for sparse views.
翻译:本文提出了一种新颖的方法,用于从稀疏多视角图像中进行3D场景与物体重建。与以往依赖额外信息(如深度或跨场景泛化特征)的方法不同,我们的方法利用多视角输入中隐含的场景属性,在无需任何预训练的情况下生成精确的伪标签用于优化。具体而言,我们提出了一种几何引导的方法,通过球谐函数在综合考虑场景中某一点所有颜色观测值的同时预测新辐射度,从而提高稀疏视角下的表面重建精度。此外,我们的管线利用代理几何体并正确处理遮挡问题以生成辐射度伪标签,这是以往图像扭曲方法无法避免的缺陷。我们的方法名为射线增强(RayAug),在无需预训练的情况下,在DTU和Blender数据集上取得了优异结果,证明了其在解决稀疏视角重建问题中的有效性。该管线具有灵活性,可集成到其他隐式神经重建方法中,用于稀疏视角场景。