Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context

Diffusion models have emerged as a popular family of deep generative models (DGMs). In the literature, it has been claimed that one class of diffusion models -- denoising diffusion probabilistic models (DDPMs) -- demonstrate superior image synthesis performance as compared to generative adversarial networks (GANs). To date, these claims have been evaluated using either ensemble-based methods designed for natural images, or conventional measures of image quality such as structural similarity. However, there remains an important need to understand the extent to which DDPMs can reliably learn medical imaging domain-relevant information, which is referred to as `spatial context' in this work. To address this, a systematic assessment of the ability of DDPMs to learn spatial context relevant to medical imaging applications is reported for the first time. A key aspect of the studies is the use of stochastic context models (SCMs) to produce training data. In this way, the ability of the DDPMs to reliably reproduce spatial context can be quantitatively assessed by use of post-hoc image analyses. Error-rates in DDPM-generated ensembles are reported, and compared to those corresponding to a modern GAN. The studies reveal new and important insights regarding the capacity of DDPMs to learn spatial context. Notably, the results demonstrate that DDPMs hold significant capacity for generating contextually correct images that are `interpolated' between training samples, which may benefit data-augmentation tasks in ways that GANs cannot.

翻译：扩散模型已成为一类流行的深度生成模型（DGM）。文献中声称，与生成对抗网络（GAN）相比，一类扩散模型——去噪扩散概率模型（DDPM）——展现出更优的图像合成性能。迄今为止，这些主张的评估要么采用为自然图像设计的集成方法，要么使用结构相似性等传统图像质量度量。然而，仍迫切需要理解DDPM在多大程度上能可靠学习医学成像领域相关信息——本文中称之为“空间上下文”。为此，首次系统评估了DDPM学习医学成像应用相关空间上下文的能力。研究的关键在于使用随机上下文模型（SCM）生成训练数据。通过这种方式，可借助事后图像分析定量评估DDPM可靠再现空间上下文的能力。报告了DDPM生成集成中的错误率，并与现代GAN对应的错误率进行了比较。研究揭示了关于DDPM学习空间上下文能力的新重要见解。值得注意的是，结果表明DDPM在生成“内插于”训练样本之间的上下文正确图像方面具有显著能力，这可能在数据增强任务中发挥GAN无法实现的优势。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日