A New Perspective On Denoising Based On Optimal Transport

In the standard formulation of the denoising problem, one is given a probabilistic model relating a latent variable $\Theta \in \Omega \subset \mathbb{R}^m \; (m\ge 1)$ and an observation $Z \in \mathbb{R}^d$ according to: $Z \mid \Theta \sim p(\cdot\mid \Theta)$ and $\Theta \sim G^*$, and the goal is to construct a map to recover the latent variable from the observation. The posterior mean, a natural candidate for estimating $\Theta$ from $Z$, attains the minimum Bayes risk (under the squared error loss) but at the expense of over-shrinking the $Z$, and in general may fail to capture the geometric features of the prior distribution $G^*$ (e.g., low dimensionality, discreteness, sparsity, etc.). To rectify these drawbacks, in this paper we take a new perspective on this denoising problem that is inspired by optimal transport (OT) theory and use it to propose a new OT-based denoiser at the population level setting. We rigorously prove that, under general assumptions on the model, our OT-based denoiser is well-defined and unique, and is closely connected to solutions to a Monge OT problem. We then prove that, under appropriate identifiability assumptions on the model, our OT-based denoiser can be recovered solely from information of the marginal distribution of $Z$ and the posterior mean of the model, after solving a linear relaxation problem over a suitable space of couplings that is reminiscent of a standard multimarginal OT (MOT) problem. In particular, thanks to Tweedie's formula, when the likelihood model $\{ p(\cdot \mid \theta) \}_{\theta \in \Omega}$ is an exponential family of distributions, the OT-based denoiser can be recovered solely from the marginal distribution of $Z$. In general, our family of OT-like relaxations is of interest in its own right and for the denoising problem suggests alternative numerical methods inspired by the rich literature on computational OT.

翻译：在标准去噪问题表述中，给定一个概率模型，该模型关联潜在变量 $\Theta \in \Omega \subset \mathbb{R}^m \; (m\ge 1)$ 与观测 $Z \in \mathbb{R}^d$，满足：$Z \mid \Theta \sim p(\cdot\mid \Theta)$ 且 $\Theta \sim G^*$，目标是构造一个映射以从观测中恢复潜在变量。后验均值作为从 $Z$ 估计 $\Theta$ 的自然候选，能够在平方误差损失下达到最小贝叶斯风险，但代价是对 $Z$ 过度收缩，且一般情况下可能无法捕捉先验分布 $G^*$ 的几何特征（例如低维性、离散性、稀疏性等）。为解决这些缺陷，本文受最优传输理论的启发，对去噪问题提出新视角，并在总体水平上提出一种基于最优传输的新去噪器。我们严格证明，在模型的一般假设下，该基于最优传输的去噪器定义良好且唯一，并与蒙日最优传输问题的解紧密相关。进一步证明，在模型的适当可识别性假设下，通过求解一个在合适耦合空间上的线性松弛问题（类似于标准多边缘最优传输问题），该基于最优传输的去噪器可仅从 $Z$ 的边缘分布和模型的后验均值信息中恢复。特别地，得益于特威迪公式，当似然模型 $\{ p(\cdot \mid \theta) \}_{\theta \in \Omega}$ 为指数分布族时，该基于最优传输的去噪器可仅从 $Z$ 的边缘分布中恢复。总体上，我们这类最优传输型松弛问题本身具有研究价值，并为去噪问题提供了受计算最优传输丰富文献启发的替代数值方法。