Flow Matching Calibration for Simulation-Based Inference under Model Misspecification

Simulation-based inference (SBI) is transforming experimental sciences by enabling parameter estimation in complex non-linear models from simulated data. A persistent challenge, however, is model misspecification. In a Bayesian setting, targeting posterior distributions, errors may arise from the simulator, the noise or prior modelling. These model components are only approximations of reality, and severe mismatches can yield biased or overconfident posteriors. We address this issue by introducing Flow Matching Corrected Posterior Estimation (FMCPE), a framework that leverages the flow matching paradigm to refine simulation-trained posterior estimators using a small set of calibration samples. Our approach proceeds in two stages: first, a posterior approximator is trained on abundant simulated data; second, flow matching transports its predictions toward the true posterior supported by calibration observations. We rely on the later to guide the correction, without requiring explicit knowledge of the misspecification form or of which model components are affected. This design enables FMCPE to combine the scalability of SBI with robustness to distributional shift. Across synthetic benchmarks and real-world datasets, we show that our proposal consistently mitigates the effects of misspecification, delivering improved inference accuracy and uncertainty quantification compared to standard SBI baselines, while remaining computationally efficient.

翻译：仿真推断（SBI）通过模拟数据实现复杂非线性模型中的参数估计，正在变革实验科学领域。然而，模型设定偏误始终是其面临的核心挑战。在贝叶斯框架下进行后验分布推断时，误差可能来源于仿真器、噪声建模或先验建模。这些模型组分仅是现实情况的近似表达，严重的模型失配会导致有偏或过度自信的后验估计。针对该问题，我们提出流匹配校正后验估计（FMCPE）框架，该框架利用流匹配范式通过少量校准样本对基于仿真的后验估计器进行精化。本方法分两阶段进行：首先，在大量模拟数据上训练后验近似器；其次，通过流匹配将其预测结果向校准观测数据支撑的真实后验分布迁移。我们依赖校准数据引导修正过程，无需显式了解模型设定偏误的形式或受影响的模型组分。该设计使FMCPE兼具SBI的可扩展性与对分布偏移的鲁棒性。合成基准测试与真实数据集实验表明，与标准SBI基线方法相比，本方法能持续缓解模型设定偏误的影响，在保持计算效率的同时，显著提升推断精度与不确定性量化质量。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

大模型错因诊断分析

专知会员服务

9+阅读 · 4月9日

【ETHZ博士论文】《结构化数据的概率模型与近似推断方法》

专知会员服务

29+阅读 · 2024年11月23日

基于因果推断的推荐系统去偏研究

专知会员服务

21+阅读 · 2024年11月10日

【剑桥大学博士论文】深度贝叶斯模型改进的变分推断方法，226页pdf

专知会员服务

49+阅读 · 2024年1月13日