We analyze connections between two low rank modeling approaches from the last decade for treating dynamical data. The first one is the coherence problem (or coherent set approach), where groups of states are sought that evolve under the action of a stochastic transition matrix in a way maximally distinguishable from other groups. The second one is a low rank factorization approach for stochastic matrices, called Direct Bayesian Model Reduction (DBMR), which estimates the low rank factors directly from observed data. We show that DBMR results in a low rank model that is a projection of the full model, and exploit this insight to infer bounds on a quantitative measure of coherence within the reduced model. Both approaches can be formulated as optimization problems, and we also prove a bound between their respective objectives. On a broader scope, this work relates the two classical loss functions of nonnegative matrix factorization, namely the Frobenius norm and the generalized Kullback--Leibler divergence, and suggests new links between likelihood-based and projection-based estimation of probabilistic models.
翻译:我们分析了近十年来处理动力学数据的两种低秩建模方法之间的联系。第一种方法是一致性问题(或称一致性集合方法),其中寻找在随机转移矩阵作用下以最大程度上区别于其他组的方式进行演化的状态群组。第二种方法是针对随机矩阵的低秩分解方法,称为直接贝叶斯模型降阶(DBMR),该方法直接从观测数据中估计低秩因子。我们证明DBMR得到的低秩模型是全模型的投影,并利用这一见解推导出降阶模型中一致性定量测度的界限。这两种方法均可表述为优化问题,我们还证明了它们各自目标函数之间的界限。从更广泛的视角来看,本研究建立了非负矩阵分解的两种经典损失函数——Frobenius范数和广义Kullback-Leibler散度——之间的联系,并揭示了基于似然和基于投影的概率模型估计之间的新关联。