This work puts forth low-complexity Riemannian subspace descent algorithms for the minimization of functions over the symmetric positive definite (SPD) manifold. Different from the existing Riemannian gradient descent variants, the proposed approach utilizes carefully chosen subspaces that allow the update to be written as a product of the Cholesky factor of the iterate and a sparse matrix. The resulting updates avoid the costly matrix operations like matrix exponentiation and dense matrix multiplication, which are generally required in almost all other Riemannian optimization algorithms on SPD manifold. We further identify a broad class of functions, arising in diverse applications, such as kernel matrix learning, covariance estimation of Gaussian distributions, maximum likelihood parameter estimation of elliptically contoured distributions, and parameter estimation in Gaussian mixture model problems, over which the Riemannian gradients can be calculated efficiently. The proposed uni-directional and multi-directional Riemannian subspace descent variants incur per-iteration complexities of $\mathcal{O}(n)$ and $\mathcal{O}(n^2)$ respectively, as compared to the $\mathcal{O}(n^3)$ or higher complexity incurred by all existing Riemannian gradient descent variants. The superior runtime and low per-iteration complexity of the proposed algorithms is also demonstrated via numerical tests on large-scale covariance estimation problems.
翻译:本文提出了一种低复杂度的黎曼子空间下降算法,用于求解对称正定流形上的函数最小化问题。与现有黎曼梯度下降变体不同,所提方法利用精心选择的子空间,使得更新步骤可表示为迭代变量的Cholesky因子与稀疏矩阵的乘积。该更新过程避免了几乎所有其他对称正定流形上的黎曼优化算法所需的矩阵指数运算和稠密矩阵乘法等高成本矩阵操作。我们进一步识别了一类广泛应用的函数——包括核矩阵学习、高斯分布协方差估计、椭圆等高分布的最大似然参数估计以及高斯混合模型中的参数估计——在这些问题上可高效计算黎曼梯度。所提出的单向和多向黎曼子空间下降变体每次迭代复杂度分别为$\mathcal{O}(n)$和$\mathcal{O}(n^2)$,而现有所有黎曼梯度下降变体则需$\mathcal{O}(n^3)$或更高复杂度。大规模协方差估计问题的数值实验也验证了所提算法在运行速度和每次迭代低复杂度上的优越性。