BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution

Diffusion models (DM) have achieved remarkable promise in image super-resolution (SR). However, most of them are tailored to solving non-blind inverse problems with fixed known degradation settings, limiting their adaptability to real-world applications that involve complex unknown degradations. In this work, we propose BlindDiff, a DM-based blind SR method to tackle the blind degradation settings in SISR. BlindDiff seamlessly integrates the MAP-based optimization into DMs, which constructs a joint distribution of the low-resolution (LR) observation, high-resolution (HR) data, and degradation kernels for the data and kernel priors, and solves the blind SR problem by unfolding MAP approach along with the reverse process. Unlike most DMs, BlindDiff firstly presents a modulated conditional transformer (MCFormer) that is pre-trained with noise and kernel constraints, further serving as a posterior sampler to provide both priors simultaneously. Then, we plug a simple yet effective kernel-aware gradient term between adjacent sampling iterations that guides the diffusion model to learn degradation consistency knowledge. This also enables to joint refine the degradation model as well as HR images by observing the previous denoised sample. With the MAP-based reverse diffusion process, we show that BlindDiff advocates alternate optimization for blur kernel estimation and HR image restoration in a mutual reinforcing manner. Experiments on both synthetic and real-world datasets show that BlindDiff achieves the state-of-the-art performance with significant model complexity reduction compared to recent DM-based methods. Code will be available at \url{https://github.com/lifengcs/BlindDiff}

翻译：扩散模型（DM）在图像超分辨率（SR）领域展现出显著潜力，但现有方法大多针对已知固定退化设置的非盲逆问题设计，难以适应包含复杂未知退化的实际应用。本文提出基于DM的盲SR方法BlindDiff，以应对SISR中的盲退化设置。BlindDiff将基于最大后验概率（MAP）的优化无缝融入扩散模型，通过构建低分辨率观测、高分辨率数据与退化核的联合分布来获取数据先验与核先验，并沿着反向过程展开MAP方法求解盲SR问题。与多数DM不同，BlindDiff首先提出一种调制条件Transformer（MCFormer）——该模型通过噪声与核约束进行预训练，可作为后验采样器同时提供两种先验。随后，我们在相邻采样迭代间嵌入简洁有效的核感知梯度项，引导扩散模型学习退化一致性知识，从而通过观测前一步去噪样本联合优化退化模型与高分辨率图像。基于MAP反向扩散过程，BlindDiff以相互强化的交替优化方式实现模糊核估计与高分辨率图像恢复。在合成与真实数据集上的实验表明，与近期基于DM的方法相比，BlindDiff在显著降低模型复杂度的同时取得了最先进性能。代码将发布于\url{https://github.com/lifengcs/BlindDiff}

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日