Uncertainty estimation is a key issue when considering the application of deep neural network methods in science and engineering. In this work, we introduce a novel algorithm that quantifies epistemic uncertainty via Monte Carlo sampling from a tempered posterior distribution. It combines the well established Metropolis Adjusted Langevin Algorithm (MALA) with momentum-based optimization using Adam and leverages a prolate proposal distribution, to efficiently draw from the posterior. We prove that the constructed chain admits the Gibbs posterior as an invariant distribution and converges to this Gibbs posterior in total variation distance. Furthermore, we demonstrate the efficiency of the resulting algorithm and the merit of the proposed changes on a state-of-the-art classifier from high-energy particle physics.
翻译:不确定性估计是考虑将深度神经网络方法应用于科学与工程领域时的关键问题。本文提出一种新颖算法,通过从退火后验分布进行蒙特卡洛采样来量化认知不确定性。该算法将成熟的Metropolis调整Langevin算法(MALA)与基于Adam的动量优化相结合,并利用长椭球状提议分布高效地从后验分布中采样。我们证明所构建的马尔可夫链以吉布斯后验为不变分布,并在全变差距离意义下收敛于该吉布斯后验。此外,我们在高能粒子物理领域的前沿分类器上验证了所提算法的效率及其改进措施的有效性。