HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images

As for human avatar reconstruction, contemporary techniques commonly necessitate the acquisition of costly data and struggle to achieve satisfactory results from a small number of casual images. In this paper, we investigate this task from a few-shot unconstrained photo album. The reconstruction of human avatars from such data sources is challenging because of limited data amount and dynamic articulated poses. For handling dynamic data, we integrate a skinning mechanism with deep marching tetrahedra (DMTet) to form a drivable tetrahedral representation, which drives arbitrary mesh topologies generated by the DMTet for the adaptation of unconstrained images. To effectively mine instructive information from few-shot data, we devise a two-phase optimization method with few-shot reference and few-shot guidance. The former focuses on aligning avatar identity with reference images, while the latter aims to generate plausible appearances for unseen regions. Overall, our framework, called HaveFun, can undertake avatar reconstruction, rendering, and animation. Extensive experiments on our developed benchmarks demonstrate that HaveFun exhibits substantially superior performance in reconstructing the human body and hand. Project website: https://seanchenxy.github.io/HaveFunWeb/.

翻译：针对人体虚拟化身重建，现有技术通常需要获取昂贵的数据，且难以从少量日常图像中获得令人满意的结果。本文研究从少量无约束相册中完成该任务。由于数据量有限且存在动态关节姿态，从此类数据源重建人体虚拟化身极具挑战性。为处理动态数据，我们将蒙皮机制与深度行进四面体（DMTet）相结合，形成可驱动的四面体表示，从而驱动DMTet生成的任意网格拓扑以适应无约束图像。为有效挖掘少量数据中的信息性内容，我们设计了一种包含少量参考与少量引导的两阶段优化方法：前者侧重对齐虚拟化身的身份与参考图像，后者旨在为未观测区域生成合理外观。总体而言，我们的框架名为HaveFun，可承担虚拟化身重建、渲染与动画任务。在我们开发基准上的大量实验表明，HaveFun在人体与手部重建方面表现出显著优越的性能。项目网站：https://seanchenxy.github.io/HaveFunWeb/。

相关内容

小样本学习

关注 216

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日