Connecting optimal transport and variational inference, we present a principled and systematic framework for sampling and generative modelling centred around divergences on path space. Our work culminates in the development of the \emph{Controlled Monte Carlo Diffusion} sampler (CMCD) for Bayesian computation, a score-based annealing technique that crucially adapts both forward and backward dynamics in a diffusion model. On the way, we clarify the relationship between the EM-algorithm and iterative proportional fitting (IPF) for Schr{\"o}dinger bridges, deriving as well a regularised objective that bypasses the iterative bottleneck of standard IPF-updates. Finally, we show that CMCD has a strong foundation in the Jarzinsky and Crooks identities from statistical physics, and that it convincingly outperforms competing approaches across a wide array of experiments.
翻译:连接最优运输与变分推断,我们围绕路径空间上的散度,提出了一套用于采样与生成建模的原则性且系统化的框架。我们的工作最终发展为用于贝叶斯计算的受控蒙特卡洛扩散采样器(CMCD),这是一种基于得分的退火技术,关键性地同时调整了扩散模型中的前向与后向动力学。在此过程中,我们阐明了用于薛定谔桥的EM算法与迭代比例拟合(IPF)之间的关系,并推导出一个正则化目标,该目标绕过了标准IPF更新的迭代瓶颈。最后,我们证明CMCD在统计物理中的Jarzinsky和Crooks恒等式具有坚实基础,并且在一系列广泛的实验中,它令人信服地优于其他竞争方法。