We present a method for fast 3D reconstruction and real-time rendering of dynamic humans from monocular videos with accompanying parametric body fits. Our method can reconstruct a dynamic human in less than 3h using a single GPU, compared to recent state-of-the-art alternatives that take up to 72h. These speedups are obtained by using a lightweight deformation model solely based on linear blend skinning, and an efficient factorized volumetric representation for modeling the shape and color of the person in canonical pose. Moreover, we propose a novel local ray marching rendering which, by exploiting standard GPU hardware and without any baking or conversion of the radiance field, allows visualizing the neural human on a mobile VR device at 40 frames per second with minimal loss of visual quality. Our experimental evaluation shows superior or competitive results with state-of-the art methods while obtaining large training speedup, using a simple model, and achieving real-time rendering.
翻译:我们提出一种从单目视频及配套参数化人体拟合中实现快速三维重建与动态人体实时渲染的方法。本方法在单GPU上可在3小时内完成动态人体重建,而现有最新替代方案需耗时长达72小时。这种加速得益于基于线性混合蒙皮的轻量形变模型,以及用于建模规范姿态下人体形状与颜色的高效因子化体积表示。此外,我们提出一种新颖的局部光线步进渲染技术,该技术利用标准GPU硬件,无需对辐射场进行任何烘焙或转换,即可在移动VR设备上以每秒40帧的帧率可视化神经人体,且视觉质量损失极小。实验结果表明,本方法在获得大幅训练加速、采用简单模型并实现实时渲染的同时,取得了与现有最优方法相当或更优的结果。