We propose Strivec, a novel neural representation that models a 3D scene as a radiance field with sparsely distributed and compactly factorized local tensor feature grids. Our approach leverages tensor decomposition, following the recent work TensoRF, to model the tensor grids. In contrast to TensoRF which uses a global tensor and focuses on their vector-matrix decomposition, we propose to utilize a cloud of local tensors and apply the classic CANDECOMP/PARAFAC (CP) decomposition to factorize each tensor into triple vectors that express local feature distributions along spatial axes and compactly encode a local neural field. We also apply multi-scale tensor grids to discover the geometry and appearance commonalities and exploit spatial coherence with the tri-vector factorization at multiple local scales. The final radiance field properties are regressed by aggregating neural features from multiple local tensors across all scales. Our tri-vector tensors are sparsely distributed around the actual scene surface, discovered by a fast coarse reconstruction, leveraging the sparsity of a 3D scene. We demonstrate that our model can achieve better rendering quality while using significantly fewer parameters than previous methods, including TensoRF and Instant-NGP.
翻译:我们提出Strivec,一种新颖的神经表示方法,通过稀疏分布且紧凑分解的局部张量特征网格将三维场景建模为辐射场。该方法借鉴近期TensoRF工作,采用张量分解技术建模张量网格。与TensoRF使用全局张量并聚焦于其向量-矩阵分解不同,我们提出利用局部张量云,并应用经典CANDECOMP/PARAFAC(CP)分解将每个张量分解为三个向量,这些向量沿空间轴表达局部特征分布,并紧凑编码局部神经场。我们还采用多尺度张量网格以发现几何与外观共性,并通过三向量分解在不同局部尺度上利用空间相干性。最终辐射场属性通过聚合来自所有尺度多个局部张量的神经特征进行回归。我们的三向量张量稀疏分布在真实场景表面周围,通过快速粗重建发现,利用了三维场景的稀疏性。实验表明,与包括TensoRF和Instant-NGP在内的先前方法相比,我们的模型在显著减少参数数量的同时可实现更优的渲染质量。