Flow Map Matching - 专知论文

Generative models based on dynamical transport of measure, such as diffusion models, flow matching models, and stochastic interpolants, learn an ordinary or stochastic differential equation whose trajectories push initial conditions from a known base distribution onto the target. While training is cheap, samples are generated via simulation, which is more expensive than one-step models like GANs. To close this gap, we introduce flow map matching -- an algorithm that learns the two-time flow map of an underlying ordinary differential equation. The approach leads to an efficient few-step generative model whose step count can be chosen a-posteriori to smoothly trade off accuracy for computational expense. Leveraging the stochastic interpolant framework, we introduce losses for both direct training of flow maps and distillation from pre-trained (or otherwise known) velocity fields. Theoretically, we show that our approach unifies many existing few-step generative models, including consistency models, consistency trajectory models, progressive distillation, and neural operator approaches, which can be obtained as particular cases of our formalism. With experiments on CIFAR-10 and ImageNet 32x32, we show that flow map matching leads to high-quality samples with significantly reduced sampling cost compared to diffusion or stochastic interpolant methods.

翻译：基于测度动态输运的生成模型，如扩散模型、流匹配模型和随机插值模型，通过学习一个常微分方程或随机微分方程，其轨迹将初始条件从已知的基分布推送到目标分布。虽然训练成本低廉，但样本生成需要通过模拟进行，这比GAN等一步生成模型更为昂贵。为弥补这一差距，我们提出了流图匹配——一种学习底层常微分方程双时间流图的算法。该方法导出了一个高效的多步生成模型，其步数可在后验中选择，以平滑地权衡精度与计算开销。利用随机插值框架，我们引入了用于直接训练流图以及从预训练（或其他已知）速度场进行蒸馏的损失函数。理论上，我们证明了该方法统一了许多现有的多步生成模型，包括一致性模型、一致性轨迹模型、渐进蒸馏和神经算子方法，这些均可作为我们形式化框架的特例而得到。通过在CIFAR-10和ImageNet 32x32上的实验，我们表明流图匹配能够以显著低于扩散或随机插值方法的采样成本生成高质量样本。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日