Despite much progress, creating real-time high-fidelity head avatar is still difficult and existing methods have to trade-off between speed and quality. 3DMM based methods often fail to model non-facial structures such as eyeglasses and hairstyles, while neural implicit models suffer from deformation inflexibility and rendering inefficiency. Although 3D Gaussian has been demonstrated to possess promising capability for geometry representation and radiance field reconstruction, applying 3D Gaussian in head avatar creation remains a major challenge since it is difficult for 3D Gaussian to model the head shape variations caused by changing poses and expressions. In this paper, we introduce PSAvatar, a novel framework for animatable head avatar creation that utilizes discrete geometric primitive to create a parametric morphable shape model and employs 3D Gaussian for fine detail representation and high fidelity rendering. The parametric morphable shape model is a Point-based Morphable Shape Model (PMSM) which uses points instead of meshes for 3D representation to achieve enhanced representation flexibility. The PMSM first converts the FLAME mesh to points by sampling on the surfaces as well as off the meshes to enable the reconstruction of not only surface-like structures but also complex geometries such as eyeglasses and hairstyles. By aligning these points with the head shape in an analysis-by-synthesis manner, the PMSM makes it possible to utilize 3D Gaussian for fine detail representation and appearance modeling, thus enabling the creation of high-fidelity avatars. We show that PSAvatar can reconstruct high-fidelity head avatars of a variety of subjects and the avatars can be animated in real-time ($\ge$ 25 fps at a resolution of 512 x 512 )
翻译:尽管取得了诸多进展,创建实时高保真头部虚拟形象仍然困难,现有方法不得不在速度与质量之间权衡。基于3DMM的方法通常难以建模眼镜、发型等非面部结构,而神经隐式模型则受限于变形灵活性不足与渲染效率低下。尽管3D高斯已被证明在几何表示与辐射场重建方面具有潜力,但其在头部虚拟形象创建中的应用仍面临重大挑战——3D高斯难以建模由姿态和表情变化引起的头部形状变化。本文提出PSAvatar,一种用于可动画头部虚拟形象创建的新型框架,该框架利用离散几何基元构建参数化可变形形状模型,并采用3D高斯实现精细细节表示与高保真渲染。该参数化可变形形状模型是基于点的可变形形状模型(PMSM),使用点而非网格进行3D表示,从而增强表示灵活性。PMSM首先通过表面采样与网格外采样将FLAME网格转换为点云,使其不仅能重建表面类结构,还能重建眼镜和发型等复杂几何体。通过以分析合成方式将这些点与头部形状对齐,PMSM使得利用3D高斯进行精细细节表示与外观建模成为可能,从而创建高保真虚拟形象。实验表明,PSAvatar可重建多种对象的头部虚拟形象,且这些虚拟形象能在实时(分辨率512×512时≥25帧/秒)条件下驱动。