Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models

Diffusion models are powerful, but they require a lot of time and data to train. We propose Patch Diffusion, a generic patch-wise training framework, to significantly reduce the training time costs while improving data efficiency, which thus helps democratize diffusion model training to broader users. At the core of our innovations is a new conditional score function at the patch level, where the patch location in the original image is included as additional coordinate channels, while the patch size is randomized and diversified throughout training to encode the cross-region dependency at multiple scales. Sampling with our method is as easy as in the original diffusion model. Through Patch Diffusion, we could achieve $\mathbf{\ge 2\times}$ faster training, while maintaining comparable or better generation quality. Patch Diffusion meanwhile improves the performance of diffusion models trained on relatively small datasets, $e.g.$, as few as 5,000 images to train from scratch. We achieve state-of-the-art FID scores 1.77 on CelebA-64$\times$64 and 1.93 on AFHQv2-Wild-64$\times$64. We will share our code and pre-trained models soon.

翻译：扩散模型功能强大，但其训练需要大量时间和数据。我们提出补丁扩散（Patch Diffusion）——一种通用的补丁级训练框架，可显著减少训练时间成本并提高数据效率，从而有助于将扩散模型训练普及至更广泛的用户群体。我们创新的核心在于提出了一种新的补丁级条件分数函数，该函数将原始图像中的补丁位置作为附加坐标通道加入，同时训练过程中对补丁尺寸进行随机化和多样化处理，以编码多尺度的跨区域依赖关系。我们的采样方法与原始扩散模型同样简单易行。通过补丁扩散，我们可实现$\mathbf{\ge 2\times}$倍的训练加速，同时保持可比或更优的生成质量。补丁扩散还能提升在较小数据集（例如仅需从5000张图像开始训练）上训练的扩散模型性能。我们在CelebA-64$\times$64上取得了1.77的FID分数，在AFHQv2-Wild-64$\times$64上取得了1.93的FID分数，均达到了业界最佳水平。我们很快将公开代码和预训练模型。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日