Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections

In recent years there has been increased interest in understanding the interplay between deep generative models (DGMs) and the manifold hypothesis. Research in this area focuses on understanding the reasons why commonly-used DGMs succeed or fail at learning distributions supported on unknown low-dimensional manifolds, as well as developing new models explicitly designed to account for manifold-supported data. This manifold lens provides both clarity as to why some DGMs (e.g. diffusion models and some generative adversarial networks) empirically surpass others (e.g. likelihood-based models such as variational autoencoders, normalizing flows, or energy-based models) at sample generation, and guidance for devising more performant DGMs. We carry out the first survey of DGMs viewed through this lens, making two novel contributions along the way. First, we formally establish that numerical instability of likelihoods in high ambient dimensions is unavoidable when modelling data with low intrinsic dimension. We then show that DGMs on learned representations of autoencoders can be interpreted as approximately minimizing Wasserstein distance: this result, which applies to latent diffusion models, helps justify their outstanding empirical results. The manifold lens provides a rich perspective from which to understand DGMs, and we aim to make this perspective more accessible and widespread.

翻译：近年来，人们对理解深度生成模型（DGMs）与流形假设之间的相互作用日益关注。该领域的研究聚焦于探究常用DGMs在支持未知低维流形分布的学习中成功或失败的原因，并开发专门针对流形支撑数据设计的新模型。这种流形视角既阐明了为何某些DGMs（如扩散模型和部分生成对抗网络）在样本生成方面经验性地超越其他模型（如基于似然的变分自编码器、标准化流或基于能量的模型），也为设计更高性能的DGMs提供了指导。我们首次通过该视角对DGMs进行系统性综述，并在此过程中提出两项创新贡献。首先，我们严格证明了当建模数据具有低本征维度时，高环境维度下似然数值不稳定性是不可避免的。随后我们证明，在自编码器学习表示上的DGMs可被解释为近似最小化Wasserstein距离：这一适用于潜在扩散模型的结果，为其卓越的经验性能提供了理论依据。流形视角为理解DGMs提供了丰富的理论框架，我们致力于使这一视角更易于理解并得到更广泛的应用。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日