The recent advance of neural fields, such as neural radiance fields, has significantly pushed the boundary of scene representation learning. Aiming to boost the computation efficiency and rendering quality of 3D scenes, a popular line of research maps the 3D coordinate system to another measuring system, e.g., 2D manifolds and hash tables, for modeling neural fields. The conversion of coordinate systems can be typically dubbed as \emph{gauge transformation}, which is usually a pre-defined mapping function, e.g., orthogonal projection or spatial hash function. This begs a question: can we directly learn a desired gauge transformation along with the neural field in an end-to-end manner? In this work, we extend this problem to a general paradigm with a taxonomy of discrete \& continuous cases, and develop a learning framework to jointly optimize gauge transformations and neural fields. To counter the problem that the learning of gauge transformations can collapse easily, we derive a general regularization mechanism from the principle of information conservation during the gauge transformation. To circumvent the high computation cost in gauge learning with regularization, we directly derive an information-invariant gauge transformation which allows to preserve scene information inherently and yield superior performance. Project: https://fnzhan.com/Neural-Gauge-Fields
翻译:神经场(如神经辐射场)的最新进展显著推动了场景表示学习的发展。为提升三维场景的计算效率与渲染质量,主流研究方向通过将三维坐标系映射至其他测量系统(如二维流形及哈希表)来构建神经场模型。这种坐标系转换通常被称为规范变换,其多为预定义的映射函数(例如正交投影或空间哈希函数)。由此引发一个关键问题:能否以端到端方式与神经场协同学习所需的规范变换?本文将问题扩展至涵盖离散与连续情形的通用范式,并开发了联合优化规范变换与神经场的统一学习框架。针对规范变换学习易坍塌的难题,我们基于规范变换中的信息守恒原理推导出通用正则化机制。为避免含正则化的规范学习产生高昂计算成本,我们直接推导出具有信息不变性的规范变换方法,该方法可从根本上保持场景信息并实现卓越性能。项目主页:https://fnzhan.com/Neural-Gauge-Fields