Efficient Sampling of Stochastic Differential Equations with Positive Semi-Definite Models

This paper deals with the problem of efficient sampling from a stochastic differential equation, given the drift function and the diffusion matrix. The proposed approach leverages a recent model for probabilities \citep{rudi2021psd} (the positive semi-definite -- PSD model) from which it is possible to obtain independent and identically distributed (i.i.d.) samples at precision $\varepsilon$ with a cost that is $m^2 d \log(1/\varepsilon)$ where $m$ is the dimension of the model, $d$ the dimension of the space. The proposed approach consists in: first, computing the PSD model that satisfies the Fokker-Planck equation (or its fractional variant) associated with the SDE, up to error $\varepsilon$, and then sampling from the resulting PSD model. Assuming some regularity of the Fokker-Planck solution (i.e. $\beta$-times differentiability plus some geometric condition on its zeros) We obtain an algorithm that: (a) in the preparatory phase obtains a PSD model with L2 distance $\varepsilon$ from the solution of the equation, with a model of dimension $m = \varepsilon^{-(d+1)/(\beta-2s)} (\log(1/\varepsilon))^{d+1}$ where $0<s\leq1$ is the fractional power to the Laplacian, and total computational complexity of $O(m^{3.5} \log(1/\varepsilon))$ and then (b) for Fokker-Planck equation, it is able to produce i.i.d.\ samples with error $\varepsilon$ in Wasserstein-1 distance, with a cost that is $O(d \varepsilon^{-2(d+1)/\beta-2} \log(1/\varepsilon)^{2d+3})$ per sample. This means that, if the probability associated with the SDE is somewhat regular, i.e. $\beta \geq 4d+2$, then the algorithm requires $O(\varepsilon^{-0.88} \log(1/\varepsilon)^{4.5d})$ in the preparatory phase, and $O(\varepsilon^{-1/2}\log(1/\varepsilon)^{2d+2})$ for each sample. Our results suggest that as the true solution gets smoother, we can circumvent the curse of dimensionality without requiring any sort of convexity.

翻译：本文针对给定漂移函数和扩散矩阵的随机微分方程（SDE）高效采样问题展开研究。所提出的方法利用近期提出的概率模型 \citep{rudi2021psd}（正半定——PSD模型），该模型能以精度 $\varepsilon$ 获得独立同分布（i.i.d.）样本，计算代价为 $m^2 d \log(1/\varepsilon)$，其中 $m$ 为模型维度，$d$ 为空间维度。本文方法包括：首先，计算满足SDE对应的Fokker-Planck方程（或其分数阶变体）的PSD模型，误差控制在 $\varepsilon$ 以内；然后，从所得PSD模型中采样。在假设Fokker-Planck解具有正则性（即 $\beta$ 次可微性及零点满足某些几何条件）的前提下，我们得到一种算法：(a) 在准备阶段，获得与方程解在L2距离上为 $\varepsilon$ 的PSD模型，模型维度为 $m = \varepsilon^{-(d+1)/(\beta-2s)} (\log(1/\varepsilon))^{d+1}$，其中 $0 < s \leq 1$ 为拉普拉斯算子的分数阶幂，总计算复杂度为 $O(m^{3.5} \log(1/\varepsilon))$；(b) 对于Fokker-Planck方程，该算法能以Wasserstein-1距离上的误差 $\varepsilon$ 生成独立同分布样本，每个样本的计算代价为 $O(d \varepsilon^{-2(d+1)/\beta-2} \log(1/\varepsilon)^{2d+3})$。这意味着，若SDE对应的概率密度函数具有适当正则性（即 $\beta \geq 4d+2$），则算法在准备阶段的复杂度为 $O(\varepsilon^{-0.88} \log(1/\varepsilon)^{4.5d})$，每个样本的复杂度为 $O(\varepsilon^{-1/2}\log(1/\varepsilon)^{2d+2})$。我们的结果表明，随着真实解趋于光滑，可在无需任何凸性假设的前提下规避维度灾难。