We develop new tools to study landscapes in nonconvex optimization. Given one optimization problem, we pair it with another by smoothly parametrizing the domain. This is either for practical purposes (e.g., to use smooth optimization algorithms with good guarantees) or for theoretical purposes (e.g., to reveal that the landscape satisfies a strict saddle property). In both cases, the central question is: how do the landscapes of the two problems relate? More precisely: how do desirable points such as local minima and critical points in one problem relate to those in the other problem? A key finding in this paper is that these relations are often determined by the parametrization itself, and are almost entirely independent of the cost function. Accordingly, we introduce a general framework to study parametrizations by their effect on landscapes. The framework enables us to obtain new guarantees for an array of problems, some of which were previously treated on a case-by-case basis in the literature. Applications include: optimizing low-rank matrices and tensors through factorizations; solving semidefinite programs via the Burer-Monteiro approach; training neural networks by optimizing their weights and biases; and quotienting out symmetries.
翻译:我们开发了研究非凸优化中景观的新工具。给定一个优化问题,我们通过平滑参数化定义域将其与另一个问题配对。这或出于实用目的(例如,使用具有良好保证的光滑优化算法),或出于理论目的(例如,揭示景观满足严格鞍点性质)。在这两种情况下,核心问题是:两个问题的景观如何关联?更精确地说:一个问题中的理想点(如局部极小值和临界点)如何映射到另一个问题中的对应点?本文的关键发现是,这些关系往往由参数化本身决定,且几乎完全独立于代价函数。据此,我们引入一个通用框架,通过参数化对景观的影响来研究参数化。该框架使我们能够为一系列问题获得新保证,其中部分问题先前在文献中需要逐案处理。应用包括:通过因子分解优化低秩矩阵和张量;通过Burer-Monteiro方法求解半定规划;通过优化权重和偏置训练神经网络;以及通过对称性商化处理。