Recent ODE/SDE-based generative models, such as diffusion models, rectified flows, and flow matching, define a generative process as a time reversal of a fixed forward process. Even though these models show impressive performance on large-scale datasets, numerical simulation requires multiple evaluations of a neural network, leading to a slow sampling speed. We attribute the reason to the high curvature of the learned generative trajectories, as it is directly related to the truncation error of a numerical solver. Based on the relationship between the forward process and the curvature, here we present an efficient method of training the forward process to minimize the curvature of generative trajectories without any ODE/SDE simulation. Experiments show that our method achieves a lower curvature than previous models and, therefore, decreased sampling costs while maintaining competitive performance. Code is available at https://github.com/sangyun884/fast-ode.
翻译:近期基于ODE/SDE的生成模型,如扩散模型、整流流和流匹配,将生成过程定义为固定前向过程的时间反转。尽管这些模型在大规模数据集上表现出色,但数值模拟需要对神经网络进行多次评估,导致采样速度缓慢。我们将原因归咎于学习到的生成轨迹的高曲率,因为它直接关系到数值求解器的截断误差。基于前向过程与曲率之间的关系,本文提出了一种高效训练前向过程的方法,无需任何ODE/SDE模拟即可最小化生成轨迹的曲率。实验表明,我们的方法相比先前模型实现了更低的曲率,从而在保持竞争力的同时降低了采样成本。代码见https://github.com/sangyun884/fast-ode。