Neural surface reconstruction has been shown to be powerful for recovering dense 3D surfaces via image-based neural rendering. However, current methods struggle to recover detailed structures of real-world scenes. To address the issue, we present Neuralangelo, which combines the representation power of multi-resolution 3D hash grids with neural surface rendering. Two key ingredients enable our approach: (1) numerical gradients for computing higher-order derivatives as a smoothing operation and (2) coarse-to-fine optimization on the hash grids controlling different levels of details. Even without auxiliary inputs such as depth, Neuralangelo can effectively recover dense 3D surface structures from multi-view images with fidelity significantly surpassing previous methods, enabling detailed large-scale scene reconstruction from RGB video captures.
翻译:神经表面重建已被证明是通过基于图像的神经渲染恢复稠密三维表面的强大方法。然而,现有方法在恢复真实场景的精细结构方面仍存在困难。为解决这一问题,我们提出Neuralangelo,该方法将多分辨率三维哈希网格的表征能力与神经表面渲染相结合。其两大关键要素为:(1)利用数值梯度计算高阶导数作为平滑操作;(2)在控制不同细节层次的哈希网格上执行从粗到细的优化。即便没有深度等辅助输入,Neuralangelo也能从多视角图像中有效恢复稠密的三维表面结构,其保真度显著超越先前方法,从而能够从RGB视频捕获中实现精细的大规模场景重建。