This article studies the Fisher-Rao gradient, also referred to as the natural gradient, of the evidence lower bound, the ELBO, which plays a crucial role within the theory of the Variational Autonecoder, the Helmholtz Machine and the Free Energy Principle. The natural gradient of the ELBO is related to the natural gradient of the Kullback-Leibler divergence from a target distribution, the prime objective function of learning. Based on invariance properties of gradients within information geometry, conditions on the underlying model are provided that ensure the equivalence of minimising the prime objective function and the maximisation of the ELBO.
翻译:本文研究了证据下界(ELBO)的Fisher-Rao梯度,也称为自然梯度,该梯度在变分自编码器、亥姆霍兹机器和自由能原理的理论中发挥着关键作用。ELBO的自然梯度与目标分布的Kullback-Leibler散度的自然梯度相关,而后者是学习的主要目标函数。基于信息几何中梯度的不变性性质,本文给出了底层模型应满足的条件,以确保最小化主要目标函数与最大化ELBO的等价性。