Computing the marginal likelihood (also called the Bayesian model evidence) is an important task in Bayesian model selection, providing a principled quantitative way to compare models. The learned harmonic mean estimator solves the exploding variance problem of the original harmonic mean estimation of the marginal likelihood. The learned harmonic mean estimator learns an importance sampling target distribution that approximates the optimal distribution. While the approximation need not be highly accurate, it is critical that the probability mass of the learned distribution is contained within the posterior in order to avoid the exploding variance problem. In previous work a bespoke optimization problem is introduced when training models in order to ensure this property is satisfied. In the current article we introduce the use of normalizing flows to represent the importance sampling target distribution. A flow-based model is trained on samples from the posterior by maximum likelihood estimation. Then, the probability density of the flow is concentrated by lowering the variance of the base distribution, i.e. by lowering its "temperature", ensuring its probability mass is contained within the posterior. This approach avoids the need for a bespoke optimisation problem and careful fine tuning of parameters, resulting in a more robust method. Moreover, the use of normalizing flows has the potential to scale to high dimensional settings. We present preliminary experiments demonstrating the effectiveness of the use of flows for the learned harmonic mean estimator. The harmonic code implementing the learned harmonic mean, which is publicly available, has been updated to now support normalizing flows.
翻译:计算边际似然(亦称贝叶斯模型证据)是贝叶斯模型选择中的重要任务,为模型比较提供了原则性的定量方法。学习调和平均估计量解决了原始调和平均估计中边际似然的方差爆炸问题。该估计量通过学习一个近似最优分布的重要性采样目标分布来实现。虽然这种近似无需高度精确,但关键在于学习分布的概率质量必须包含在后验分布内,以避免方差爆炸问题。先前研究中通过在训练模型时引入定制优化问题来确保这一性质成立。本文提出使用归一化流来表示重要性采样目标分布。基于后验分布样本通过最大似然估计训练流模型,随后通过降低基础分布的方差(即降低其"温度")来集中流的概率密度,确保其概率质量包含在后验内。该方法无需定制优化问题与参数精细调优,从而获得更稳健的估计结果。此外,归一化流的使用具有扩展到高维场景的潜力。我们通过初步实验证明了流模型在学习调和平均估计量中的有效性。公开可用的实现学习调和平均的Harmonic代码现已更新支持归一化流。