Real-Time Radiance Fields for Single-Image Portrait View Synthesis

We present a one-shot method to infer and render a photorealistic 3D representation from a single unposed image (e.g., face portrait) in real-time. Given a single RGB input, our image encoder directly predicts a canonical triplane representation of a neural radiance field for 3D-aware novel view synthesis via volume rendering. Our method is fast (24 fps) on consumer hardware, and produces higher quality results than strong GAN-inversion baselines that require test-time optimization. To train our triplane encoder pipeline, we use only synthetic data, showing how to distill the knowledge from a pretrained 3D GAN into a feedforward encoder. Technical contributions include a Vision Transformer-based triplane encoder, a camera data augmentation strategy, and a well-designed loss function for synthetic data training. We benchmark against the state-of-the-art methods, demonstrating significant improvements in robustness and image quality in challenging real-world settings. We showcase our results on portraits of faces (FFHQ) and cats (AFHQ), but our algorithm can also be applied in the future to other categories with a 3D-aware image generator.

翻译：我们提出了一种单次推断方法，可从单张非摆拍图像（如人脸肖像）实时推断并渲染出逼真的三维表示。给定单张RGB输入，我们的图像编码器直接预测神经辐射场的规范三平面表示，通过体渲染实现三维感知的新视角合成。该方法在消费级硬件上达到24帧/秒的实时速度，且生成质量优于需要测试时优化的强GAN反演基线方法。为训练三平面编码器流水线，我们仅使用合成数据，展示了如何将预训练三维GAN的知识蒸馏至前馈编码器。技术贡献包括基于Vision Transformer的三平面编码器、相机数据增强策略及适用于合成数据训练的精心设计的损失函数。我们与最先进方法进行了基准测试，证明了在具有挑战性的真实场景中鲁棒性和图像质量的显著提升。我们在人脸（FFHQ）和猫脸（AFHQ）肖像上展示了结果，但我们的算法未来也可应用于其他具有三维感知图像生成器的类别。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

116+阅读 · 2020年4月5日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日