Despite much progress, achieving real-time high-fidelity head avatar animation is still difficult and existing methods have to trade-off between speed and quality. 3DMM based methods often fail to model non-facial structures such as eyeglasses and hairstyles, while neural implicit models suffer from deformation inflexibility and rendering inefficiency. Although 3D Gaussian has been demonstrated to possess promising capability for geometry representation and radiance field reconstruction, applying 3D Gaussian in head avatar creation remains a major challenge since it is difficult for 3D Gaussian to model the head shape variations caused by changing poses and expressions. In this paper, we introduce PSAvatar, a novel framework for animatable head avatar creation that utilizes discrete geometric primitive to create a parametric morphable shape model and employs 3D Gaussian for fine detail representation and high fidelity rendering. The parametric morphable shape model is a Point-based Morphable Shape Model (PMSM) which uses points instead of meshes for 3D representation to achieve enhanced representation flexibility. The PMSM first converts the FLAME mesh to points by sampling on the surfaces as well as off the meshes to enable the reconstruction of not only surface-like structures but also complex geometries such as eyeglasses and hairstyles. By aligning these points with the head shape in an analysis-by-synthesis manner, the PMSM makes it possible to utilize 3D Gaussian for fine detail representation and appearance modeling, thus enabling the creation of high-fidelity avatars. We show that PSAvatar can reconstruct high-fidelity head avatars of a variety of subjects and the avatars can be animated in real-time ($\ge$ 25 fps at a resolution of 512 $\times$ 512 ).
翻译:尽管已取得诸多进展,实现实时高保真头部虚拟形象动画仍然困难,现有方法不得不在速度与质量之间进行权衡。基于3DMM的方法通常难以建模非面部结构(如眼镜和发型),而神经隐式模型则受限于形变灵活性不足和渲染效率低下。虽然3D高斯已被证明在几何表示和辐射场重建方面具有良好潜力,但将其应用于头部虚拟形象创建仍面临重大挑战,因为3D高斯难以建模由姿态和表情变化引起的头部形状变化。本文提出PSAvatar,一种用于可动画头部虚拟形象创建的新型框架,该框架利用离散几何基元构建参数化可变形形状模型,并采用3D高斯进行细节表示和高保真渲染。参数化可变形形状模型是一种基于点的可变形形状模型(PMSM),其使用点而非网格进行三维表示,以实现更强的表示灵活性。PMSM首先通过在FLAME网格表面及外部采样,将网格转换为点云,从而能够重建不仅包括类表面结构,还能重建如眼镜和发型等复杂几何体。通过以分析-合成方式将这些点与头部形状对齐,PMSM使得利用3D高斯进行细节表示和外观建模成为可能,从而能够创建高保真虚拟形象。实验表明,PSAvatar能够重建多种对象的高保真头部虚拟形象,并支持实时动画(在512×512分辨率下帧率≥25 fps)。