We present a novel framework, called FrameNeRF, designed to apply off-the-shelf fast high-fidelity NeRF models with fast training speed and high rendering quality for few-shot novel view synthesis tasks. The training stability of fast high-fidelity models is typically constrained to dense views, making them unsuitable for few-shot novel view synthesis tasks. To address this limitation, we utilize a regularization model as a data generator to produce dense views from sparse inputs, facilitating subsequent training of fast high-fidelity models. Since these dense views are pseudo ground truth generated by the regularization model, original sparse images are then used to fine-tune the fast high-fidelity model. This process helps the model learn realistic details and correct artifacts introduced in earlier stages. By leveraging an off-the-shelf regularization model and a fast high-fidelity model, our approach achieves state-of-the-art performance across various benchmark datasets.
翻译:我们提出了一种名为FrameNeRF的新型框架,旨在将现成的快速高保真NeRF模型应用于少样本新视角合成任务,兼具快速训练速度与高质量渲染性能。快速高保真模型的训练稳定性通常局限于密集视角场景,使其难以适用于少样本新视角合成任务。为解决这一限制,我们利用一个正则化模型作为数据生成器,从稀疏输入中生成密集视角,从而支撑后续快速高保真模型的训练。由于这些密集视角是由正则化模型生成的伪真实数据,我们进一步使用原始稀疏图像对快速高保真模型进行微调。该过程有助于模型学习真实细节并修正前期阶段引入的伪影。通过结合现成的正则化模型与快速高保真模型,我们的方法在多个基准数据集上达到了最先进的性能。