Traditional supervised denoisers are trained using pairs of noisy input and clean target images. They learn to predict a central tendency of the posterior distribution over possible clean images. When, e.g., trained with the popular quadratic loss function, the network's output will correspond to the minimum mean square error (MMSE) estimate. Unsupervised denoisers based on Variational AutoEncoders (VAEs) have succeeded in achieving state-of-the-art results while requiring only unpaired noisy data as training input. In contrast to the traditional supervised approach, unsupervised denoisers do not directly produce a single prediction, such as the MMSE estimate, but allow us to draw samples from the posterior distribution of clean solutions corresponding to the noisy input. To approximate the MMSE estimate during inference, unsupervised methods have to create and draw a large number of samples - a computationally expensive process - rendering the approach inapplicable in many situations. Here, we present an alternative approach that trains a deterministic network alongside the VAE to directly predict a central tendency. Our method achieves results that surpass the results achieved by the unsupervised method at a fraction of the computational cost.
翻译:传统监督去噪器通过使用成对的噪声输入和干净目标图像进行训练。它们学习预测干净图像后验分布的中心趋势。例如,当使用流行的二次损失函数训练时,网络的输出将对应最小均方误差(MMSE)估计。基于变分自编码器(VAE)的无监督去噪器在仅需未配对噪声数据作为训练输入的情况下,成功实现了最先进的结果。与传统监督方法不同,无监督去噪器不直接产生单一预测(如MMSE估计),而是允许我们从与噪声输入对应的干净解后验分布中采样。为在推断过程中近似MMSE估计,无监督方法必须生成并抽取大量样本——这是一个计算密集型过程——使得该方法在许多情况下不适用。在此,我们提出一种替代方案,在VAE旁训练一个确定性网络以直接预测中心趋势。我们的方法以极低的计算成本,取得了超越无监督方法的结果。