Learning Image Priors through Patch-based Diffusion Models for Solving Inverse Problems

Diffusion models can learn strong image priors from underlying data distribution and use them to solve inverse problems, but the training process is computationally expensive and requires lots of data. Such bottlenecks prevent most existing works from being feasible for high-dimensional and high-resolution data such as 3D images. This paper proposes a method to learn an efficient data prior for the entire image by training diffusion models only on patches of images. Specifically, we propose a patch-based position-aware diffusion inverse solver, called PaDIS, where we obtain the score function of the whole image through scores of patches and their positional encoding and utilize this as the prior for solving inverse problems. First of all, we show that this diffusion model achieves an improved memory efficiency and data efficiency while still maintaining the capability to generate entire images via positional encoding. Additionally, the proposed PaDIS model is highly flexible and can be plugged in with different diffusion inverse solvers (DIS). We demonstrate that the proposed PaDIS approach enables solving various inverse problems in both natural and medical image domains, including CT reconstruction, deblurring, and superresolution, given only patch-based priors. Notably, PaDIS outperforms previous DIS methods trained on entire image priors in the case of limited training data, demonstrating the data efficiency of our proposed approach by learning patch-based prior.

翻译：扩散模型能够从底层数据分布中学习到强大的图像先验，并利用其解决逆问题，但训练过程计算成本高昂且需要大量数据。此类瓶颈使得现有大多数方法难以适用于高维和高分辨率数据（例如3D图像）。本文提出一种方法，仅通过在图像块上训练扩散模型，即可学习到针对整幅图像的高效数据先验。具体而言，我们提出一种基于块且具有位置感知的扩散逆求解器，称为PaDIS。在该方法中，我们通过图像块的分数及其位置编码来获得整幅图像的分数函数，并将其用作解决逆问题的先验。首先，我们证明该扩散模型在通过位置编码保持生成完整图像能力的同时，实现了更高的内存效率和数据效率。此外，所提出的PaDIS模型具有高度灵活性，可与不同的扩散逆求解器（DIS）结合使用。实验表明，在仅具备基于块的先验知识条件下，所提出的PaDIS方法能够解决自然图像和医学图像领域的多种逆问题，包括CT重建、去模糊和超分辨率。值得注意的是，在训练数据有限的情况下，PaDIS的性能优于以往基于完整图像先验训练的DIS方法，这证明了我们通过基于块的先验学习所实现的数据高效性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日