Few-shot Image Generation with Diffusion Models

Denoising diffusion probabilistic models (DDPMs) have been proven capable of synthesizing high-quality images with remarkable diversity when trained on large amounts of data. However, to our knowledge, few-shot image generation tasks have yet to be studied with DDPM-based approaches. Modern approaches are mainly built on Generative Adversarial Networks (GANs) and adapt models pre-trained on large source domains to target domains using a few available samples. In this paper, we make the first attempt to study when do DDPMs overfit and suffer severe diversity degradation as training data become scarce. Then we fine-tune DDPMs pre-trained on large source domains to solve the overfitting problem when training data is limited. Although the directly fine-tuned models accelerate convergence and improve generation quality and diversity compared with training from scratch, they still fail to retain some diverse features and can only produce coarse images. Therefore, we design a DDPM pairwise adaptation (DDPM-PA) approach to optimize few-shot DDPM domain adaptation. DDPM-PA efficiently preserves information learned from source domains by keeping the relative pairwise distances between generated samples during adaptation. Besides, DDPM-PA enhances the learning of high-frequency details from source models and limited training data. DDPM-PA further improves generation quality and diversity and achieves results better than current state-of-the-art GAN-based approaches. We demonstrate the effectiveness of our approach on a series of few-shot image generation tasks qualitatively and quantitatively.

翻译：去噪扩散概率模型（DDPMs）已被证明在大量数据训练时能够合成具有显著多样性的高质量图像。然而，据我们所知，目前尚未有基于DDPM的方法研究少样本图像生成任务。现代方法主要建立在生成对抗网络（GANs）基础上，通过利用少量可用样本将预训练于大型源域的模型适配到目标域。本文首次研究了DDPMs在训练数据稀缺时何时会出现过拟合及严重多样性退化的问题。随后，我们对预训练于大型源域的DDPMs进行微调，以解决训练数据有限时的过拟合问题。尽管直接微调后的模型与从头训练相比能加速收敛并提升生成质量与多样性，但仍无法保留某些多样性特征，且仅能生成粗糙图像。为此，我们提出一种DDPM成对适配（DDPM-PA）方法，以优化少样本DDPM域适配。DDPM-PA通过保持适配过程中生成样本间的相对成对距离，有效保留从源域学到的信息。此外，DDPM-PA增强了对源模型及有限训练数据中高频细节的学习。DDPM-PA进一步提升了生成质量与多样性，并取得了优于当前最先进GAN方法的结果。我们通过一系列少样本图像生成任务定性与定量地验证了该方法有效性。

相关内容

小样本学习

关注 216

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日