The accuracy of face recognition systems has improved significantly in the past few years, thanks to the large amount of data collected and the advancement in neural network architectures. However, these large-scale datasets are often collected without explicit consent, raising ethical and privacy concerns. To address this, there have been proposals to use synthetic datasets for training face recognition models. Yet, such models still rely on real data to train the generative models and generally exhibit inferior performance compared to those trained on real datasets. One of these datasets, DigiFace, uses a graphics pipeline to generate different identities and different intra-class variations without using real data in training the models. However, the performance of this approach is poor on face recognition benchmarks, possibly due to the lack of realism in the images generated from the graphics pipeline. In this work, we introduce a novel framework for realism transfer aimed at enhancing the realism of synthetically generated face images. Our method leverages the large-scale face foundation model, and we adapt the pipeline for realism enhancement. By integrating the controllable aspects of the graphics pipeline with our realism enhancement technique, we generate a large amount of realistic variations-combining the advantages of both approaches. Our empirical evaluations demonstrate that models trained using our enhanced dataset significantly improve the performance of face recognition systems over the baseline. The source code and datasets will be made available publicly: https://www.idiap.ch/paper/digi2real
翻译:得益于海量数据的收集和神经网络架构的进步,人脸识别系统的准确性在过去几年中得到了显著提升。然而,这些大规模数据集通常未经明确同意而收集,引发了伦理和隐私方面的担忧。为解决这一问题,已有提议使用合成数据集来训练人脸识别模型。然而,此类模型仍需依赖真实数据来训练生成模型,并且其性能通常逊色于在真实数据集上训练的模型。其中,DigiFace数据集采用图形学管线生成不同身份及类内变化,而无需在模型训练中使用真实数据。但该方法在人脸识别基准测试中表现不佳,这可能是由于图形学管线生成的图像缺乏真实感。在本工作中,我们提出了一种新颖的真实感迁移框架,旨在增强合成生成的人脸图像的真实感。我们的方法利用大规模人脸基础模型,并调整了管线以实现真实感增强。通过将图形学管线的可控方面与我们的真实感增强技术相结合,我们生成了大量真实感强的变化图像,融合了两种方法的优势。我们的实证评估表明,使用我们增强后的数据集训练的模型,相较于基线,显著提升了人脸识别系统的性能。源代码和数据集将公开提供:https://www.idiap.ch/paper/digi2real