Flow matching has emerged as a simulation-free alternative to diffusion-based generative modeling, producing samples by solving an ODE whose time-dependent velocity field is learned along an interpolation between a simple source distribution (e.g., a standard normal) and a target data distribution. Flow-based methods often exhibit greater training stability and have achieved strong empirical performance in high-dimensional settings where data concentrate near a low-dimensional manifold, such as text-to-image synthesis, video generation, and molecular structure generation. Despite this success, existing theoretical analyses of flow matching assume target distributions with smooth, full-dimensional densities, leaving its effectiveness in manifold-supported settings largely unexplained. To this end, we theoretically analyze flow matching with linear interpolation when the target distribution is supported on a smooth manifold. We establish a non-asymptotic convergence guarantee for the learned velocity field, and then propagate this estimation error through the ODE to obtain statistical consistency of the implicit density estimator induced by the flow-matching objective. The resulting convergence rate is near minimax-optimal, depends only on the intrinsic dimension, and reflects the smoothness of both the manifold and the target distribution. Together, these results provide a principled explanation for how flow matching adapts to intrinsic data geometry and circumvents the curse of dimensionality.
翻译:流匹配已成为基于扩散的生成建模的一种无模拟替代方法,通过求解一个依赖于时间的常微分方程(ODE)生成样本,该方程的速度场是在简单源分布(例如标准正态分布)与目标数据分布之间的插值过程中学习的。基于流的方法通常表现出更好的训练稳定性,并在数据集中于低维流形的高维场景中取得了强劲的实证表现,例如文本到图像合成、视频生成和分子结构生成。尽管取得了这些成功,现有关于流匹配的理论分析通常假设目标分布具有光滑的全维密度,未能解释其在流形支持设定下的有效性。为此,我们在线性插值下对目标分布支持在光滑流形上的流匹配进行了理论分析。我们为学习到的速度场建立了非渐近收敛保证,然后通过ODE传播该估计误差,以得到由流匹配目标诱导的隐式密度估计量的统计一致性。所得收敛率接近极小化最优,仅取决于内在维度,并反映了流形和目标分布的光滑程度。综合来看,这些结果为流匹配如何适应数据内在几何结构并规避维度灾难提供了原理性解释。