In this paper, we explore adaptive inference based on variational Bayes. Although several studies have been conducted to analyze the contraction properties of variational posteriors, there is still a lack of a general and computationally tractable variational Bayes method that performs adaptive inference. To fill this gap, we propose a novel adaptive variational Bayes framework, which can operate on a collection of models. The proposed framework first computes a variational posterior over each individual model separately and then combines them with certain weights to produce a variational posterior over the entire model. It turns out that this combined variational posterior is the closest member to the posterior over the entire model in a predefined family of approximating distributions. We show that the adaptive variational Bayes attains optimal contraction rates adaptively under very general conditions. We also provide a methodology to maintain the tractability and adaptive optimality of the adaptive variational Bayes even in the presence of an enormous number of individual models, such as sparse models. We apply the general results to several examples, including deep learning and sparse factor models, and derive new and adaptive inference results. In addition, we characterize an implicit regularization effect of variational Bayes and show that the adaptive variational posterior can utilize this.
翻译:本文探讨基于变分贝叶斯的自适应推断。尽管已有众多研究分析了变分后验的收缩性质,但目前仍缺乏一种兼具通用性与计算可行性的自适应推断变分贝叶斯方法。为填补这一空白,我们提出了一种新颖的自适应变分贝叶斯框架,该框架可作用于多个模型集合。所提框架首先分别计算每个单独模型的变分后验,然后通过特定权重对它们进行组合,从而生成整个模型的变分后验。结果表明,在预定义的近似分布族中,该组合变分后验是与整个模型后验最为接近的成员。我们证明,在非常一般的条件下,自适应变分贝叶斯能够自适应地达到最优收缩率。同时,我们还提供一种方法,使得即便面对海量个体模型(如稀疏模型)时,自适应变分贝叶斯仍能保持其计算可行性与自适应最优性。我们将一般性结果应用于深度学习、稀疏因子模型等多个实例,并推导出全新自适应推断结论。此外,我们刻画了变分贝叶斯的隐式正则化效应,并证明自适应变分后验能够利用这一特性。