Given an unnormalized probability density $\pi\propto\mathrm{e}^{-V}$, estimating its normalizing constant $Z=\int_{\mathbb{R}^d}\mathrm{e}^{-V(x)}\mathrm{d}x$ or free energy $F=-\log Z$ is a crucial problem in Bayesian statistics, statistical mechanics, and machine learning. It is challenging especially in high dimensions or when $\pi$ is multimodal. To mitigate the high variance of conventional importance sampling estimators, annealing-based methods such as Jarzynski equality and annealed importance sampling are commonly adopted, yet their quantitative complexity guarantees remain largely unexplored. We take a first step toward a non-asymptotic analysis of annealed importance sampling. In particular, we derive an oracle complexity of $\widetilde{O}\left(\frac{d\beta^2{\mathcal{A}}^2}{\varepsilon^4}\right)$ for estimating $Z$ within $\varepsilon$ relative error with high probability, where $\beta$ is the smoothness of $V$ and $\mathcal{A}$ denotes the action of a curve of probability measures interpolating $\pi$ and a tractable reference distribution. Our analysis, leveraging Girsanov theorem and optimal transport, does not explicitly require isoperimetric assumptions on the target distribution. Finally, to tackle the large action of the widely used geometric interpolation of probability distributions, we propose a new normalizing constant estimation algorithm based on reverse diffusion samplers and establish a framework for analyzing its complexity.
翻译:给定一个未归一化的概率密度 $\pi\propto\mathrm{e}^{-V}$,估计其归一化常数 $Z=\int_{\mathbb{R}^d}\mathrm{e}^{-V(x)}\mathrm{d}x$ 或自由能 $F=-\log Z$ 是贝叶斯统计、统计力学和机器学习中的一个关键问题。该问题在高维或 $\pi$ 为多峰分布时尤为困难。为缓解传统重要性采样估计量的高方差,通常采用基于退火的方法,如雅任斯基等式和退火重要性采样,然而其定量的复杂度保证在很大程度上仍未得到探索。我们首次对退火重要性采样进行了非渐近分析。具体而言,我们推导出以高概率在 $\varepsilon$ 相对误差内估计 $Z$ 的预言机复杂度为 $\widetilde{O}\left(\frac{d\beta^2{\mathcal{A}}^2}{\varepsilon^4}\right)$,其中 $\beta$ 是 $V$ 的光滑度,$\mathcal{A}$ 表示连接 $\pi$ 与一个易处理的参考分布的概率测度曲线的"作用量"。我们的分析利用了Girsanov定理和最优传输理论,并未明确要求目标分布满足等周假设。最后,为应对广泛使用的概率分布几何插值所产生的大作用量,我们提出了一种基于反向扩散采样器的新归一化常数估计算法,并建立了一个分析其复杂度的框架。