Orthogonal matrices play an important role in probability and statistics, particularly in high-dimensional statistical models. Parameterizing these models using orthogonal matrices facilitates dimension reduction and parameter identification. However, establishing the theoretical validity of statistical inference in these models from a frequentist perspective is challenging, leading to a preference for Bayesian approaches because of their ability to offer consistent uncertainty quantification. Markov chain Monte Carlo methods are commonly used for numerical approximation of posterior distributions, and sampling on the Stiefel manifold, which comprises orthogonal matrices, poses significant difficulties. While various strategies have been proposed for this purpose, gradient-based Markov chain Monte Carlo with parameterizations is the most efficient. However, a comprehensive comparison of these parameterizations is lacking in the existing literature. This study aims to address this gap by evaluating numerical efficiency of the four alternative parameterizations of orthogonal matrices under equivalent conditions. The evaluation was conducted for four problems. The results suggest that polar expansion parameterization is the most efficient, particularly for the high-dimensional and complex problems. However, all parameterizations exhibit limitations in significantly high-dimensional or difficult tasks, emphasizing the need for further advancements in sampling methods for orthogonal matrices.
翻译:正交矩阵在概率与统计学中扮演重要角色,特别是在高维统计模型中。利用正交矩阵对这些模型进行参数化有助于降维和参数识别。然而,从频率学派视角建立这些模型统计推断的理论有效性具有挑战性,因此贝叶斯方法因其能够提供一致的量化不确定性而受到青睐。马尔可夫链蒙特卡洛方法常用于后验分布的数值逼近,而在由正交矩阵构成的Stiefel流形上进行采样具有显著难度。尽管已有多种策略用于此目的,但基于梯度的参数化马尔可夫链蒙特卡洛方法最为高效。然而,现有文献缺乏对这些参数化方法的系统比较。本研究旨在通过评估四种正交矩阵替代参数化方法在等价条件下的数值效率来填补这一空白。评估涵盖四个问题,结果表明极分解参数化方法最为高效,尤其适用于高维复杂问题。但在维度极高或任务极难的情况下,所有参数化方法均显现局限性,这凸显了需进一步推进正交矩阵采样方法的研究。