In few-shot image generation, directly training GAN models on just a handful of images faces the risk of overfitting. A popular solution is to transfer the models pretrained on large source domains to small target ones. In this work, we introduce WeditGAN, which realizes model transfer by editing the intermediate latent codes $w$ in StyleGANs with learned constant offsets ($\Delta w$), discovering and constructing target latent spaces via simply relocating the distribution of source latent spaces. The established one-to-one mapping between latent spaces can naturally prevents mode collapse and overfitting. Besides, we also propose variants of WeditGAN to further enhance the relocation process by regularizing the direction or finetuning the intensity of $\Delta w$. Experiments on a collection of widely used source/target datasets manifest the capability of WeditGAN in generating realistic and diverse images, which is simple yet highly effective in the research area of few-shot image generation.
翻译:在少样本图像生成任务中,直接基于少量图像训练生成对抗网络(GAN)模型存在过拟合风险。一种主流解决方案是将在大规模源域预训练的模型迁移至小规模目标域。本文提出WeditGAN,该方法通过学习恒定偏移量($\Delta w$)编辑StyleGAN中的中间隐编码$w$,实现模型迁移——即通过简单重定位源隐空间分布来发现并构建目标隐空间。该隐空间间建立的一一对应映射天然能防止模式崩塌与过拟合。此外,我们进一步提出WeditGAN的变体,通过正则化偏移方向或微调$\Delta w$强度来增强重定位过程。在多个广泛使用的源/目标数据集上的实验表明,WeditGAN能够生成逼真且多样化的图像。该方法在少样本图像生成研究领域中简洁高效。