We introduce two new particle-based algorithms for learning latent variable models via marginal maximum likelihood estimation, including one which is entirely tuning-free. Our methods are based on the perspective of marginal maximum likelihood estimation as an optimization problem: namely, as the minimization of a free energy functional. One way to solve this problem is to consider the discretization of a gradient flow associated with the free energy. We study one such approach, which resembles an extension of the popular Stein variational gradient descent algorithm. In particular, we establish a descent lemma for this algorithm, which guarantees that the free energy decreases at each iteration. This method, and any other obtained as the discretization of the gradient flow, will necessarily depend on a learning rate which must be carefully tuned by the practitioner in order to ensure convergence at a suitable rate. With this in mind, we also propose another algorithm for optimizing the free energy which is entirely learning rate free, based on coin betting techniques from convex optimization. We validate the performance of our algorithms across a broad range of numerical experiments, including several high-dimensional settings. Our results are competitive with existing particle-based methods, without the need for any hyperparameter tuning.
翻译:我们提出了两种新的基于粒子的算法,用于通过边际最大似然估计学习潜变量模型,其中一种完全无需调参。我们的方法基于边际最大似然估计作为优化问题的视角:即将其视为自由能函数的最小化。求解该问题的一种途径是考虑与自由能相关的梯度流的离散化。我们研究了这样一种方法,它类似于流行的斯坦变分梯度下降算法的扩展。特别地,我们为该算法建立了一个下降引理,保证自由能在每次迭代中递减。该方法以及任何通过梯度流离散化得到的其他方法,必然依赖于学习率,实践者必须仔细调整该参数以确保以合适的速度收敛。基于此,我们提出了另一种优化自由能的算法,该算法基于凸优化中的硬币赌博技术,完全无需学习率。我们通过大量数值实验(包括若干高维场景)验证了算法的性能。我们的结果与现有基于粒子的方法具有竞争力,且无需任何超参数调优。