Bayesian inference provides a powerful tool for leveraging observational data to inform model predictions and uncertainties. However, when such data is limited, Bayesian inference may not adequately constrain uncertainty without the use of highly informative priors. Common approaches for constructing informative priors typically rely on either assumptions or knowledge of the underlying physics, which may not be available in all scenarios. In this work, we consider the scenario where data are available on a population of assets/individuals, which occurs in many problem domains such as biomedical or digital twin applications, and leverage this population-level data to systematically constrain the Bayesian prior and subsequently improve individualized inferences. The approach proposed in this paper is based upon a recently developed technique known as data-consistent inversion (DCI) for constructing a pullback probability measure. Succinctly, we utilize DCI to build population-informed priors for subsequent Bayesian inference on individuals. While the approach is general and applies to nonlinear maps and arbitrary priors, we prove that for linear inverse problems with Gaussian priors, the population-informed prior produces an increase in the information gain as measured by the determinant and trace of the inverse posterior covariance. We also demonstrate that the Kullback-Leibler divergence often improves with high probability. Numerical results, including linear-Gaussian examples and one inspired by digital twins for additively manufactured assets, indicate that there is significant value in using these population-informed priors.
翻译:贝叶斯推断为利用观测数据指导模型预测与不确定性量化提供了强大工具。然而当数据有限时,若无强信息性先验,贝叶斯推断可能无法充分约束不确定性。构建信息性先验的常用方法通常依赖于对底层物理机制的假设或认知,但这在某些场景中可能无法实现。本研究针对存在资产/个体群体数据的场景(常见于生物医学或数字孪生等领域),利用群体层面的数据系统性地约束贝叶斯先验分布,进而改进个体化推断。本文提出的方法基于近期发展的数据一致反演技术,用于构建拉回概率测度。简言之,我们利用DCI构建群体信息先验,以支持后续针对个体的贝叶斯推断。虽然该方法具有普适性,适用于非线性映射和任意先验分布,但我们证明对于具有高斯先验的线性反问题,群体信息先验能通过后验协方差逆矩阵的行列式和迹来度量信息增益的提升。我们还证明Kullback-Leibler散度通常以高概率得到改善。数值实验结果(包括线性高斯示例和受增材制造资产数字孪生启发的案例)表明,使用这些群体信息先验具有显著价值。