PatchScaler: An Efficient Patch-Independent Diffusion Model for Super-Resolution

Diffusion models significantly improve the quality of super-resolved images with their impressive content generation capabilities. However, the huge computational costs limit the applications of these methods.Recent efforts have explored reasonable inference acceleration to reduce the number of sampling steps, but the computational cost remains high as each step is performed on the entire image.This paper introduces PatchScaler, a patch-independent diffusion-based single image super-resolution (SR) method, designed to enhance the efficiency of the inference process.The proposed method is motivated by the observation that not all the image patches within an image need the same sampling steps for reconstructing high-resolution images.Based on this observation, we thus develop a Patch-adaptive Group Sampling (PGS) to divide feature patches into different groups according to the patch-level reconstruction difficulty and dynamically assign an appropriate sampling configuration for each group so that the inference speed can be better accelerated.In addition, to improve the denoising ability at each step of the sampling, we develop a texture prompt to guide the estimations of the diffusion model by retrieving high-quality texture priors from a patch-independent reference texture memory.Experiments show that our PatchScaler achieves favorable performance in both quantitative and qualitative evaluations with fast inference speed.Our code and model are available at \url{https://github.com/yongliuy/PatchScaler}.

翻译：扩散模型凭借其卓越的内容生成能力，显著提升了超分辨率图像的质量。然而，巨大的计算成本限制了这些方法的应用。近期研究探索了合理的推理加速以减少采样步数，但由于每一步都在整张图像上执行，计算成本依然高昂。本文提出了PatchScaler，一种基于扩散的、与图像块无关的单图像超分辨率方法，旨在提升推理过程的效率。该方法的提出基于以下观察：图像中并非所有图像块都需要相同的采样步数来重建高分辨率图像。基于此观察，我们开发了一种块自适应分组采样方法，根据图像块级别的重建难度将特征块划分为不同组，并为每组动态分配合适的采样配置，从而更好地加速推理。此外，为了提升采样每一步的去噪能力，我们开发了一种纹理提示机制，通过从一个与图像块无关的参考纹理记忆中检索高质量纹理先验，来指导扩散模型的估计。实验表明，我们的PatchScaler在定量和定性评估中均取得了优越的性能，并具有快速的推理速度。我们的代码和模型可在 \url{https://github.com/yongliuy/PatchScaler} 获取。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/