Neural Markov chain Monte Carlo: Bayesian inversion via normalizing flows and variational autoencoders

This paper introduces a Bayesian framework that combines Markov chain Monte Carlo (MCMC) sampling, dimensionality reduction, and neural density estimation to efficiently handle inverse problems that (i) must be solved multiple times, and (ii) are characterized by intractable or unavailable likelihood functions. The posterior probability distribution over quantities of interest is estimated via differential evolution Metropolis sampling, empowered by learnable mappings. First, a variational autoencoder performs probabilistic feature extraction from observational data. The resulting latent structure inherently quantifies uncertainty, capturing deviations between the actual data-generating process and the training data distribution. At each step of the MCMC random walk, the algorithm jointly samples from the data-informed latent distribution and the space of parameters to be inferred. These samples are fed into a neural likelihood estimator based on normalizing flows, specifically real-valued non-volume preserving transformations. The scaling and translation functions of the affine coupling layers are modeled by neural networks conditioned on the unknown parameters, allowing the representation of arbitrary observation likelihoods. The proposed methodology is validated on two case studies: (i) structural health monitoring of a railway bridge for damage detection, localization, and quantification, and (ii) estimation of the conductivity field in a steady-state Darcy's groundwater flow problem. The results demonstrate the efficiency of the inference strategy, while ensuring that model-reality mismatches do not yield overconfident, yet inaccurate, estimates.

翻译：本文提出了一种贝叶斯框架，该框架结合了马尔可夫链蒙特卡洛采样、降维与神经密度估计，以高效处理需多次求解且具有难以处理或不可得似然函数的反问题。通过可学习映射增强的差分进化Metropolis采样，估计目标量的后验概率分布。首先，变分自编码器对观测数据进行概率特征提取。所得潜在结构固有地量化了不确定性，捕获了实际数据生成过程与训练数据分布之间的偏差。在MCMC随机游走的每一步，算法联合采样数据驱动的潜在分布与待推断参数空间。这些样本被输入基于标准化流（具体为实值非体积保持变换）的神经似然估计器。仿射耦合层的缩放与平移函数由以未知参数为条件的神经网络建模，从而能够表示任意观测似然。所提方法在两个案例研究中得到验证：（i）铁路桥梁结构健康监测中的损伤检测、定位与量化；（ii）稳态达西地下水流问题中的导率场估计。结果表明该推断策略具有高效性，同时确保模型-现实失配不会产生过度自信却不准确的估计。

相关内容

马尔可夫链

关注 289

马尔可夫链，因安德烈·马尔可夫（A.A.Markov，1856－1922）得名，是指数学中具有马尔可夫性质的离散事件随机过程。该过程中，在给定当前知识或信息的情况下，过去（即当前以前的历史状态）对于预测将来（即当前以后的未来状态）是无关的。在马尔可夫链的每一步，系统根据概率分布，可以从一个状态变到另一个状态，也可以保持当前状态。状态的改变叫做转移，与不同的状态改变相关的概率叫做转移概率。随机漫步就是马尔可夫链的例子。随机漫步中每一步的状态是在图形中的点，每一步可以移动到任何一个相邻的点，在这里移动到每一个点的概率都是相同的（无论之前漫步路径是如何的）。

神经网络如何安全可靠？牛津大学博士论文《贝叶斯神经网络的对抗鲁棒性》，206页pdf

专知会员服务

66+阅读 · 2022年11月10日

《通过最优传输失配措施进行鲁棒性贝叶斯推断：应用和算法》麻省理工学院2022最新博士论文

专知会员服务

15+阅读 · 2022年6月21日

【ICLR2022】Transformers亦能贝叶斯推断

专知会员服务

25+阅读 · 2021年12月23日

【LUND博士论文】基于模拟的推断:从近似贝叶斯计算和粒子方法到神经密度估计，223页pdf

专知会员服务

26+阅读 · 2021年10月8日