PSAvatar: A Point-based Morphable Shape Model for Real-Time Head Avatar Creation with 3D Gaussian Splatting

Despite much progress, achieving real-time high-fidelity head avatar animation is still difficult and existing methods have to trade-off between speed and quality. 3DMM based methods often fail to model non-facial structures such as eyeglasses and hairstyles, while neural implicit models suffer from deformation inflexibility and rendering inefficiency. Although 3D Gaussian has been demonstrated to possess promising capability for geometry representation and radiance field reconstruction, applying 3D Gaussian in head avatar creation remains a major challenge since it is difficult for 3D Gaussian to model the head shape variations caused by changing poses and expressions. In this paper, we introduce PSAvatar, a novel framework for animatable head avatar creation that utilizes discrete geometric primitive to create a parametric morphable shape model and employs 3D Gaussian for fine detail representation and high fidelity rendering. The parametric morphable shape model is a Point-based Morphable Shape Model (PMSM) which uses points instead of meshes for 3D representation to achieve enhanced representation flexibility. The PMSM first converts the FLAME mesh to points by sampling on the surfaces as well as off the meshes to enable the reconstruction of not only surface-like structures but also complex geometries such as eyeglasses and hairstyles. By aligning these points with the head shape in an analysis-by-synthesis manner, the PMSM makes it possible to utilize 3D Gaussian for fine detail representation and appearance modeling, thus enabling the creation of high-fidelity avatars. We show that PSAvatar can reconstruct high-fidelity head avatars of a variety of subjects and the avatars can be animated in real-time ($\ge$ 25 fps at a resolution of 512 $\times$ 512 ).

翻译：摘要：尽管已有诸多进展，实现实时高保真头部虚拟形象动画仍具挑战，现有方法不得不在速度与质量间权衡。基于3DMM的方法常难以建模眼镜、发型等非面部结构，而神经隐式模型则存在形变灵活性不足与渲染效率低下的问题。尽管3D高斯已被证明在几何表示与辐射场重建方面具有潜力，但将其应用于头部虚拟形象构建仍面临重大挑战——3D高斯难以建模由姿态与表情变化引起的头部形状变化。本文提出PSAvatar，一种新颖的可动画头部虚拟形象构建框架，该框架利用离散几何基元创建参数化可变形形状模型，并采用3D高斯进行精细细节表示与高保真渲染。其中，参数化可变形形状模型是一种基于点的可变形形状模型（PMSM），它以点而非网格作为三维表示介质，从而增强了表示灵活性。PMSM首先通过FLAME网格表面采样及网格外采样将网格转化为点集，使其不仅能重建类表面结构，还能重建眼镜、发型等复杂几何体。通过以分析-合成方式将这些点与头部形状对齐，PMSM使得利用3D高斯进行精细细节表示与外观建模成为可能，从而创建高保真虚拟形象。实验表明，PSAvatar可重建多种对象的高保真头部虚拟形象，且这些虚拟形象能实现实时动画（在512×512分辨率下帧率≥25 fps）。