The $ω$-Condition Number for Optimal Preconditioning and Low Rank Generalized Jacobian Updating

Preconditioning is essential in iterative methods for solving linear systems. It is also the implicit objective in updating approximations of Jacobians in optimization methods, e.g., in quasi-Newton methods. Motivated by the latter, we study a nonclassic matrix condition number, the $\omega$-condition number. We do this in the context of optimal conditioning for: (i) our application to low rank updating of generalized Jacobians; (ii) iterative methods for linear systems: (iia) clustering of eigenvalues and (iib) convergence rates. For a positive definite matrix, the $\omega$-condition measure is the ratio of the arithmetic and geometric means of the eigenvalues. In particular, our applications concentrate on linear systems with low rank updates of ill-conditioned positive definite matrices. These systems arise in the context of nonsmooth Newton methods using generalized Jacobians. We are able to use optimality conditions and derive explicit formulae for $\omega$-optimal preconditioners and preconditioned updates. Connections to partial Cholesky sparse preconditioners are made. Evaluating or estimating the classical condition number $\kappa$ can be expensive. We show that the $\omega$-condition number can be evaluated explicitly following a Cholesky or LU factorization. Moreover, the simplicity of $\omega$ allows for the derivation of formulae for optimal preconditioning in various scenarios, i.e., this avoids the need for expensive algorithmic calculations. Our empirics show that $\omega$ estimates the actual condition of a linear system significantly better. Moreover, our empirical results show a significant decrease in the number of iterations required for a requested accuracy in the residual during an iterative method, i.e., these results confirm the efficacy of using the $\omega$-condition number compared to the classical condition number.

翻译：预处理在求解线性系统的迭代方法中至关重要，它同时也是优化方法（如拟牛顿法）中更新雅可比近似矩阵的隐含目标。受后者启发，我们研究了一种非经典的矩阵条件数——ω-条件数。我们在以下最优条件化的背景下展开研究：(i) 广义雅可比矩阵低秩更新的应用；(ii) 线性系统的迭代方法：(iia) 特征值聚类与(iib) 收敛速率。对于正定矩阵，ω-条件测度是特征值算术平均与几何平均之比。特别地，我们的应用集中于具有病态正定矩阵低秩更新的线性系统。这类系统出现在使用广义雅可比矩阵的非光滑牛顿方法中。我们能够利用最优性条件，推导出ω-最优预处理矩阵及预处理更新的显式公式，并与部分Cholesky稀疏预处理方法建立联系。经典条件数κ的计算或估计可能代价高昂。我们证明ω-条件数可在Cholesky或LU分解后显式计算。此外，ω的简洁性使得能够在多种场景下推导最优预处理的显式公式，从而避免昂贵的算法计算。实验表明，ω能显著更准确地估计线性系统的实际条件。同时，实证结果显示，在迭代方法中为达到残差所需精度所需的迭代次数显著减少，这些结果证实了使用ω-条件数相较于经典条件数的有效性。