Consider an empirical measure $\mathbb{P}_n$ induced by $n$ iid samples from a $d$-dimensional $K$-subgaussian distribution $\mathbb{P}$ and let $\gamma = \mathcal{N}(0,\sigma^2 I_d)$ be the isotropic Gaussian measure. We study the speed of convergence of the smoothed Wasserstein distance $W_2(\mathbb{P}_n * \gamma, \mathbb{P}*\gamma) = n^{-\alpha + o(1)}$ with $*$ being the convolution of measures. For $K<\sigma$ and in any dimension $d\ge 1$ we show that $\alpha = {1\over2}$. For $K>\sigma$ in dimension $d=1$ we show that the rate is slower and is given by $\alpha = {(\sigma^2 + K^2)^2\over 4 (\sigma^4 + K^4)} < 1/2$. This resolves several open problems in \cite{goldfeld2020convergence}, and in particular precisely identifies the amount of smoothing $\sigma$ needed to obtain a parametric rate. In addition, we also establish that $D_{KL}(\mathbb{P}_n * \gamma \|\mathbb{P}*\gamma)$ has rate $O(1/n)$ for $K<\sigma$ but only slows down to $O({(\log n)^{d+1}\over n})$ for $K>\sigma$. The surprising difference of the behavior of $W_2^2$ and KL implies the failure of $T_{2}$-transportation inequality when $\sigma < K$. Consequently, the requirement $K<\sigma$ is necessary for validity of the log-Sobolev inequality (LSI) for the Gaussian mixture $\mathbb{P} * \mathcal{N}(0, \sigma^{2})$, closing an open problem in \cite{wang2016functional}, who established the LSI under precisely this condition.
翻译:考虑由来自d维K-次高斯分布$\mathbb{P}$的n个独立同分布样本所诱导的经验测度$\mathbb{P}_n$,并令$\gamma = \mathcal{N}(0,\sigma^2 I_d)$为各向同性高斯测度。我们研究平滑Wasserstein距离$W_2(\mathbb{P}_n * \gamma, \mathbb{P}*\gamma) = n^{-\alpha + o(1)}$的收敛速度,其中$*$表示测度的卷积。当$K<\sigma$且维度$d\ge 1$时,我们证明$\alpha = {1\over2}$。当$K>\sigma$且维度$d=1$时,我们证明收敛速率更慢,由$\alpha = {(\sigma^2 + K^2)^2\over 4 (\sigma^4 + K^4)} < 1/2$给出。这解决了文献\cite{goldfeld2020convergence}中的若干开放问题,特别是精确确定了为获得参数化速率所需的平滑量$\sigma$。此外,我们还证明了当$K<\sigma$时$D_{KL}(\mathbb{P}_n * \gamma \|\mathbb{P}*\gamma)$的速率为$O(1/n)$,而当$K>\sigma$时速率降至$O({(\log n)^{d+1}\over n})$。$W_2^2$与KL行为之间的显著差异表明当$\sigma < K$时$T_{2}$-运输不等式不成立。因此,条件$K<\sigma$对于高斯混合分布$\mathbb{P} * \mathcal{N}(0, \sigma^{2})$的对数Sobolev不等式(LSI)的有效性是必要的,这同时解决了文献\cite{wang2016functional}中提出的一个开放问题,该文献正是在此条件下建立了LSI。