Learned harmonic mean estimation of the marginal likelihood with normalizing flows

Computing the marginal likelihood (also called the Bayesian model evidence) is an important task in Bayesian model selection, providing a principled quantitative way to compare models. The learned harmonic mean estimator solves the exploding variance problem of the original harmonic mean estimation of the marginal likelihood. The learned harmonic mean estimator learns an importance sampling target distribution that approximates the optimal distribution. While the approximation need not be highly accurate, it is critical that the probability mass of the learned distribution is contained within the posterior in order to avoid the exploding variance problem. In previous work a bespoke optimization problem is introduced when training models in order to ensure this property is satisfied. In the current article we introduce the use of normalizing flows to represent the importance sampling target distribution. A flow-based model is trained on samples from the posterior by maximum likelihood estimation. Then, the probability density of the flow is concentrated by lowering the variance of the base distribution, i.e. by lowering its "temperature", ensuring its probability mass is contained within the posterior. This approach avoids the need for a bespoke optimisation problem and careful fine tuning of parameters, resulting in a more robust method. Moreover, the use of normalizing flows has the potential to scale to high dimensional settings. We present preliminary experiments demonstrating the effectiveness of the use of flows for the learned harmonic mean estimator. The harmonic code implementing the learned harmonic mean, which is publicly available, has been updated to now support normalizing flows.

翻译：计算边际似然（亦称贝叶斯模型证据）是贝叶斯模型选择中的重要任务，它为模型比较提供了原则性的量化方法。学习调和均值估计器解决了原始调和均值估计在边际似然计算中的方差爆炸问题。该估计器通过学习一个近似最优分布的重要性采样目标分布。虽然该近似无需高度精确，但关键要求是学习到的分布的概率质量必须包含在后验分布内，以避免方差爆炸问题。先前的工作通过引入定制化优化问题来训练模型以满足这一特性。本文提出使用归一化流来表示重要性采样目标分布。基于流的模型通过最大似然估计在后验样本上进行训练。随后，通过降低基分布的方差（即降低其"温度"）来集中流的概率密度，确保其概率质量包含在后验分布内。该方法避免了定制化优化问题和参数精细调优的需求，从而提高了方法的鲁棒性。此外，归一化流的使用具有扩展到高维场景的潜力。我们通过初步实验证明了流方法在学习调和均值估计器中的有效性。目前已公开实现学习调和均值的Harmonic代码已更新，支持归一化流。