Though denoising diffusion probabilistic models (DDPMs) have achieved remarkable generation results, the low sampling efficiency of DDPMs still limits further applications. Since DDPMs can be formulated as diffusion ordinary differential equations (ODEs), various fast sampling methods can be derived from solving diffusion ODEs. However, we notice that previous sampling methods with fixed analytical form are not robust with the error in the noise estimated from pretrained diffusion models. In this work, we construct an error-robust Adams solver (ERA-Solver), which utilizes the implicit Adams numerical method that consists of a predictor and a corrector. Different from the traditional predictor based on explicit Adams methods, we leverage a Lagrange interpolation function as the predictor, which is further enhanced with an error-robust strategy to adaptively select the Lagrange bases with lower error in the estimated noise. Experiments on Cifar10, LSUN-Church, and LSUN-Bedroom datasets demonstrate that our proposed ERA-Solver achieves 5.14, 9.42, and 9.69 Fenchel Inception Distance (FID) for image generation, with only 10 network evaluations.
翻译:尽管去噪扩散概率模型(DDPMs)已取得了显著的生成效果,但其低采样效率仍限制了进一步应用。由于DDPMs可建模为扩散常微分方程(ODEs),从求解扩散ODEs出发可推导出多种快速采样方法。然而,我们注意到先前具有固定解析形式的采样方法对预训练扩散模型估计噪声中的误差不够鲁棒。本研究构建了一种误差鲁棒的Adams求解器(ERA-Solver),该求解器采用由预测器和校正器组成的隐式Adams数值方法。不同于基于显式Adams方法的传统预测器,我们利用拉格朗日插值函数作为预测器,并进一步结合误差鲁棒策略自适应选择估计噪声中误差更低的拉格朗日基。在Cifar10、LSUN-Church和LSUN-Bedroom数据集上的实验表明,所提出的ERA-Solver在仅需10次网络评估的条件下,图像生成的FID分数分别达到5.14、9.42和9.69。