SciRE-Solver: Efficient Sampling of Diffusion Probabilistic Models by Score-integrand Solver with Recursive Derivative Estimation

Diffusion probabilistic models (DPMs) are a powerful class of generative models known for their ability to generate high-fidelity image samples. A major challenge in the implementation of DPMs is the slow sampling process. In this work, we bring a high-efficiency sampler for DPMs. Specifically, we propose a score-based exact solution paradigm for the diffusion ODEs corresponding to the sampling process of DPMs, which introduces a new perspective on developing numerical algorithms for solving diffusion ODEs. To achieve an efficient sampler, we propose a recursive derivative estimation (RDE) method to reduce the estimation error. With our proposed solution paradigm and RDE method, we propose the score-integrand solver with the convergence order guarantee as efficient solver (SciRE-Solver) for solving diffusion ODEs. The SciRE-Solver attains state-of-the-art (SOTA) sampling performance with a limited number of score function evaluations (NFE) on both discrete-time and continuous-time DPMs in comparison to existing training-free sampling algorithms. Such as, we achieve $3.48$ FID with $12$ NFE and $2.42$ FID with $20$ NFE for continuous-time DPMs on CIFAR10, respectively. Different from other samplers, SciRE-Solver has the promising potential to surpass the FIDs achieved in the original papers of some pre-trained models with just fewer NFEs. For example, we reach SOTA value of $2.40$ FID with $100$ NFE for continuous-time DPM and of $3.15$ FID with $84$ NFE for discrete-time DPM on CIFAR-10, as well as of $2.17$ ($2.02$) FID with $18$ ($50$) NFE for discrete-time DPM on CelebA 64$\times$64.

翻译：扩散概率模型（DPMs）是一类强大的生成模型，以其生成高保真图像样本的能力而著称。DPMs实现中的主要挑战在于其缓慢的采样过程。本文提出了一种面向DPMs的高效采样器。具体而言，我们为对应DPMs采样过程的扩散常微分方程（ODE）提出了一种基于评分的精确解范式，为开发求解扩散ODE的数值算法提供了新视角。为实现高效采样器，我们提出了递归导数估计（RDE）方法来降低估计误差。基于所提出的解范式与RDE方法，我们设计了具有收敛阶保证的评分积分求解器（SciRE-Solver），用于高效求解扩散ODE。与现有无训练采样算法相比，SciRE-Solver在离散时间和连续时间DPMs上均能以有限评分函数评估次数（NFE）达到最先进的采样性能。例如，在CIFAR-10数据集上，对于连续时间DPMs，我们分别以12次NFE实现3.48的FID值，以20次NFE实现2.42的FID值。与其他采样器不同，SciRE-Solver具有仅需更少NFE即可超越部分预训练模型原始论文所获FID值的潜力。例如，在CIFAR-10数据集上，我们以100次NFE对连续时间DPM达到2.40的FID值，以84次NFE对离散时间DPM达到3.15的FID值；在CelebA 64×64数据集上，以18次（50次）NFE对离散时间DPM分别达到2.17（2.02）的FID值。