The conventional understanding of adversarial training in generative adversarial networks (GANs) is that the discriminator is trained to estimate a divergence, and the generator learns to minimize this divergence. We argue that despite the fact that many variants of GANs were developed following this paradigm, the current theoretical understanding of GANs and their practical algorithms are inconsistent. In this paper, we leverage Wasserstein gradient flows which characterize the evolution of particles in the sample space, to gain theoretical insights and algorithmic inspiration of GANs. We introduce a unified generative modeling framework - MonoFlow: the particle evolution is rescaled via a monotonically increasing mapping of the log density ratio. Under our framework, adversarial training can be viewed as a procedure first obtaining MonoFlow's vector field via training the discriminator and the generator learns to draw the particle flow defined by the corresponding vector field. We also reveal the fundamental difference between variational divergence minimization and adversarial training. This analysis helps us to identify what types of generator loss functions can lead to the successful training of GANs and suggest that GANs may have more loss designs beyond the literature (e.g., non-saturated loss), as long as they realize MonoFlow. Consistent empirical studies are included to validate the effectiveness of our framework.
翻译:传统上对生成对抗网络(GANs)中对抗训练的理解是:判别器被训练以估计某种散度,而生成器则学习最小化该散度。尽管许多GAN变体都遵循这一范式发展,我们指出当前GAN的理论理解与实际算法之间存在不一致性。本文利用Wasserstein梯度流(刻画样本空间中粒子演化规律的理论工具)来获得GAN的理论洞见与算法启示。我们提出统一生成建模框架——MonoFlow:通过log密度比的单调递增映射对粒子演化进行重新缩放。在该框架下,对抗训练可视为一个过程:首先通过训练判别器获取MonoFlow的向量场,然后生成器学习绘制由对应向量场所定义的粒子流。我们还揭示了变分散度最小化与对抗训练之间的根本差异。该分析有助于识别哪些类型的生成器损失函数能实现GAN的成功训练,并表明只要能够实现MonoFlow,GAN可能存在超越现有文献(如非饱和损失)的更多损失函数设计方案。我们通过一致的实验研究验证了该框架的有效性。