Multi-fidelity Parameter Estimation Using Conditional Diffusion Models

We present a multi-fidelity method for uncertainty quantification of parameter estimates in complex systems, leveraging generative models trained to sample the target conditional distribution. In the Bayesian inference setting, traditional parameter estimation methods rely on repeated simulations of potentially expensive forward models to determine the posterior distribution of the parameter values, which may result in computationally intractable workflows. Furthermore, methods such as Markov Chain Monte Carlo (MCMC) necessitate rerunning the entire algorithm for each new data observation, further increasing the computational burden. Hence, we propose a novel method for efficiently obtaining posterior distributions of parameter estimates for high-fidelity models given data observations of interest. The method first constructs a low-fidelity, conditional generative model capable of amortized Bayesian inference and hence rapid posterior density approximation over a wide-range of data observations. When higher accuracy is needed for a specific data observation, the method employs adaptive refinement of the density approximation. It uses outputs from the low-fidelity generative model to refine the parameter sampling space, ensuring efficient use of the computationally expensive high-fidelity solver. Subsequently, a high-fidelity, unconditional generative model is trained to achieve greater accuracy in the target posterior distribution. Both low- and high- fidelity generative models enable efficient sampling from the target posterior and do not require repeated simulation of the high-fidelity forward model. We demonstrate the effectiveness of the proposed method on several numerical examples, including cases with multi-modal densities, as well as an application in plasma physics for a runaway electron simulation model.

翻译：我们提出了一种多保真度方法，用于量化复杂系统中参数估计的不确定性，该方法利用经过训练以采样目标条件分布的生成模型。在贝叶斯推断框架下，传统的参数估计方法依赖于对可能计算成本高昂的正向模型进行重复模拟，以确定参数值的后验分布，这可能导致计算上不可行的工作流程。此外，诸如马尔可夫链蒙特卡洛（MCMC）等方法需要对每个新的数据观测重新运行整个算法，进一步增加了计算负担。因此，我们提出了一种新颖的方法，用于在给定感兴趣的数据观测下，高效地获取高保真度模型的参数估计后验分布。该方法首先构建一个低保真度的条件生成模型，该模型能够进行摊销贝叶斯推断，从而在广泛的数据观测范围内快速近似后验密度。当针对特定数据观测需要更高精度时，该方法采用密度近似的自适应细化。它利用低保真度生成模型的输出来细化参数采样空间，确保高效利用计算成本高昂的高保真度求解器。随后，训练一个高保真度的无条件生成模型，以在目标后验分布中实现更高的精度。低保真度和高保真度生成模型都能够从目标后验中高效采样，且无需重复模拟高保真度正向模型。我们在多个数值示例上证明了所提方法的有效性，包括具有多模态密度的情况，以及在等离子体物理中逃逸电子模拟模型的应用。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日