基于分数差的隐式生成建模流 (The Score-Difference Flow for Implicit Generative Modeling)

from arxiv, 25 pages, 5 figures, 4 tables. Updated, lightly revised version of a paper originally published in Transactions on Machine Learning Research (TMLR)

Implicit generative modeling (IGM) aims to produce samples of synthetic data matching the characteristics of a target data distribution. Recent work (e.g. score-matching networks, diffusion models) has approached the IGM problem from the perspective of pushing synthetic source data toward the target distribution via dynamical perturbations or flows in the ambient space. In this direction, we present the score difference (SD) between arbitrary target and source distributions as a flow that optimally reduces the Kullback-Leibler divergence between them. We apply the SD flow to convenient proxy distributions, which are aligned if and only if the original distributions are aligned. We demonstrate the formal equivalence of this formulation to denoising diffusion models under certain conditions. We also show that the training of generative adversarial networks includes a hidden data-optimization sub-problem, which induces the SD flow under certain choices of loss function when the discriminator is optimal. As a result, the SD flow provides a theoretical link between model classes that individually address the three challenges of the "generative modeling trilemma" -- high sample quality, mode coverage, and fast sampling -- thereby setting the stage for a unified approach.

翻译：隐式生成建模旨在生成与目标数据分布特征匹配的合成数据样本。近期研究（如分数匹配网络、扩散模型）从动态扰动或环境空间流的角度推动合成源数据向目标分布逼近。在此方向上，我们提出任意目标分布与源分布之间的分数差可构建为一种能最优降低两者间KL散度的流。我们将该分数差流应用于便捷的代理分布，这些代理分布当且仅当原始分布对齐时才会对齐。我们证明了该形式化描述在特定条件下与去噪扩散模型的等价性。同时揭示了生成对抗网络的训练包含一个隐含的数据优化子问题，当判别器达到最优时，该问题在特定损失函数选择下会诱导出分数差流。因此，分数差流为分别应对“生成建模三难困境”——高样本质量、模式覆盖与快速采样——的各类模型建立了理论联系，从而为统一方法奠定了基础。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日