With the development of virtualization and AI, real-time facial avatar animation is widely used in entertainment, office, business and other fields. Against this background, blendshapes have become a common industry animation solution because of their relative simplicity and ease of interpretation. Aiming for real-time performance and low computing resource dependence, we independently developed an accurate blendshape prediction system for low-power VR applications using a standard webcam. First, blendshape feature vectors are extracted through affine transformation and segmentation. Through further transformation and regression analysis, we were able to identify models for most blendshapes with significant predictive power. Post-processing was used to further improve response stability, including smoothing filtering and nonlinear transformations to minimize error. Experiments showed the system achieved accuracy similar to ARKit 6. Our model has low sensor/hardware requirements and realtime response with a consistent, accurate and smooth visual experience.
翻译:随着虚拟化和人工智能技术的发展,实时面部虚拟形象动画已广泛应用于娱乐、办公、商业等领域。在此背景下,混合形状因其相对简单且易于解释的特性,已成为行业通用的动画解决方案。为实现实时性能并降低对计算资源的依赖,我们基于标准网络摄像头独立开发了一套适用于低功耗VR应用的精确混合形状预测系统。首先通过仿射变换与分割提取混合形状特征向量;经过进一步变换与回归分析,我们为大多数混合形状建立了具有显著预测能力的模型。采用后处理技术进一步提升响应稳定性,包括平滑滤波与非线性变换以最小化误差。实验表明该系统达到了与ARKit 6相近的精度。我们的模型对传感器/硬件要求较低,具备实时响应能力,并能提供稳定、准确且流畅的视觉体验。