Beyond Random Matrix Theory for Deep Networks

We investigate whether the Wigner semi-circle and Marcenko-Pastur distributions, often used for deep neural network theoretical analysis, match empirically observed spectral densities. We find that even allowing for outliers, the observed spectral shapes strongly deviate from such theoretical predictions. This raises major questions about the usefulness of these models in deep learning. We further show that theoretical results, such as the layered nature of critical points, are strongly dependent on the use of the exact form of these limiting spectral densities. We consider two new classes of matrix ensembles; random Wigner/Wishart ensemble products and percolated Wigner/Wishart ensembles, both of which better match observed spectra. They also give large discrete spectral peaks at the origin, providing a theoretical explanation for the observation that various optima can be connected by one dimensional of low loss values. We further show that, in the case of a random matrix product, the weight of the discrete spectral component at $0$ depends on the ratio of the dimensions of the weight matrices.

翻译：我们调查的是,通常用于深神经网络理论分析的Wigner半圆轴和Marcenko-Pastur分布是否与实验观测到的光谱密度相匹配。我们发现,即使允许外部光谱,所观测到的光谱形状也与这种理论预测大相径庭。这引起了关于这些模型在深层次学习中的效用的重大问题。我们进一步表明,诸如临界点的多层性质等理论结果在很大程度上取决于这些限制光谱密度的确切形式的使用。我们考虑的是两个新的矩阵组合类别:随机的Wigner/Wishart合体产品和经过渗透的Wigner/Wishart组合,两者都与观测到的光谱相匹配。它们也给原始的大型离散光谱峰提供了理论上的解释,即各种opima可以通过低损失值的一维连接。我们进一步表明,在随机矩阵产品中,离散光谱组成部分的重量为0.0美元,取决于重量矩阵的比重。

相关内容

矩阵论

关注 6

随着科学技术的迅速发展，古典的线性代数知识已不能满足现代科技的需要，矩阵的理论和方法业已成为现代科技领域必不可少的工具。诸如数值分析、优化理论、微分方程、概率统计、控制论、力学、电子学、网络等学科领域都与矩阵理论有着密切的联系，甚至在经济管理、金融、保险、社会科学等领域，矩阵理论和方法也有着十分重要的应用。当今电子计算机及计算技术的迅速发展为矩阵理论的应用开辟了更广阔的前景。因此，学习和掌握矩阵的基本理论和方法，对于工科研究生来说是必不可少的。全国的工科院校已普遍把“矩阵论”作为研究生的必修课。

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【图与几何深度学习，53页ppt】Graph and geometric deep learning

专知会员服务

91+阅读 · 2021年6月14日

【图神经网络导论】Intro to Graph Neural Networks，176页ppt

专知会员服务

129+阅读 · 2021年6月4日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日