SODA: Bottleneck Diffusion Models for Representation Learning

We introduce SODA, a self-supervised diffusion model, designed for representation learning. The model incorporates an image encoder, which distills a source view into a compact representation, that, in turn, guides the generation of related novel views. We show that by imposing a tight bottleneck between the encoder and a denoising decoder, and leveraging novel view synthesis as a self-supervised objective, we can turn diffusion models into strong representation learners, capable of capturing visual semantics in an unsupervised manner. To the best of our knowledge, SODA is the first diffusion model to succeed at ImageNet linear-probe classification, and, at the same time, it accomplishes reconstruction, editing and synthesis tasks across a wide range of datasets. Further investigation reveals the disentangled nature of its emergent latent space, that serves as an effective interface to control and manipulate the model's produced images. All in all, we aim to shed light on the exciting and promising potential of diffusion models, not only for image generation, but also for learning rich and robust representations.

翻译：我们提出SODA，一种用于表示学习的自监督扩散模型。该模型包含一个图像编码器，将源视图压缩为紧凑表示，进而指导相关新颖视图的生成。研究表明，通过在编码器与去噪解码器之间施加严格瓶颈，并利用新颖视图合成作为自监督目标，可以将扩散模型转化为强大的表示学习器，以无监督方式捕捉视觉语义。据我们所知，SODA是首个在ImageNet线性探测分类任务上取得成功的扩散模型，同时能在多种数据集上完成重建、编辑和合成任务。进一步探究揭示其涌现的潜在空间具有解耦特性，该空间可作为有效接口来控制与操纵模型生成的图像。总而言之，我们旨在揭示扩散模型不仅限于图像生成，更在学习丰富稳健表示方面令人振奋且充满前景的潜力。

相关内容

SODA

关注 0

本专题讨论会主要讨论离散问题之有效演算法与资料结构。除了这些方法和结构的设计，还包括它们的使用、性能分析以及与它们的发展或局限性相关的数学问题。性能分析可以是分析性的，也可以是实验性的，可以是针对最坏情况或预期情况的性能。研究可以是理论性的，也可以是基于实践中出现的数据集，可以解决绩效分析中涉及的方法学问题。官网链接：https://www.siam.org/conferences/cm/conference/soda20

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日