Does Diffusion Beat GAN in Image Super Resolution?

There is a prevalent opinion in the recent literature that Diffusion-based models outperform GAN-based counterparts on the Image Super Resolution (ISR) problem. However, in most studies, Diffusion-based ISR models were trained longer and utilized larger networks than the GAN baselines. This raises the question of whether the superiority of Diffusion models is due to the Diffusion paradigm being better suited for the ISR task or if it is a consequence of the increased scale and computational resources used in contemporary studies. In our work, we compare Diffusion-based and GAN-based Super Resolution under controlled settings, where both approaches are matched in terms of architecture, model and dataset size, and computational budget. We show that a GAN-based model can achieve results comparable to a Diffusion-based model. Additionally, we explore the impact of design choices such as text conditioning and augmentation on the performance of ISR models, showcasing their effect on several downstream tasks. We will release the inference code and weights of our scaled GAN.

翻译：近期文献中存在一种普遍观点，认为基于扩散的模型在图像超分辨率任务上表现优于基于生成对抗网络的模型。然而，在大多数研究中，基于扩散的图像超分辨率模型的训练时间更长，且使用的网络规模大于生成对抗网络基线模型。这引发了一个问题：扩散模型的优越性究竟是源于扩散范式本身更适用于图像超分辨率任务，还是当代研究中增加的模型规模与计算资源所导致的结果？在本研究中，我们在受控设置下比较了基于扩散与基于生成对抗网络的超分辨率方法，确保两种方法在架构、模型与数据集规模以及计算预算方面均保持一致。我们证明基于生成对抗网络的模型能够取得与基于扩散的模型相当的结果。此外，我们探究了文本条件化与数据增强等设计选择对图像超分辨率模型性能的影响，并展示了这些选择在多个下游任务中的作用。我们将公开所提出的规模化生成对抗网络的推理代码与权重。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日