Neural Persistence is a prominent measure for quantifying neural network complexity, proposed in the emerging field of topological data analysis in deep learning. In this work, however, we find both theoretically and empirically that the variance of network weights and spatial concentration of large weights are the main factors that impact neural persistence. Whilst this captures useful information for linear classifiers, we find that no relevant spatial structure is present in later layers of deep neural networks, making neural persistence roughly equivalent to the variance of weights. Additionally, the proposed averaging procedure across layers for deep neural networks does not consider interaction between layers. Based on our analysis, we propose an extension of the filtration underlying neural persistence to the whole neural network instead of single layers, which is equivalent to calculating neural persistence on one particular matrix. This yields our deep graph persistence measure, which implicitly incorporates persistent paths through the network and alleviates variance-related issues through standardisation. Code is available at https://github.com/ExplainableML/Deep-Graph-Persistence .
翻译:神经持久性是深度学习中新兴拓扑数据分析领域提出的一种衡量神经网络复杂性的重要指标。然而,本工作从理论和实证两方面发现:网络权重的方差和大权重的空间集中度是影响神经持久性的主要因素。虽然这一特性对线性分类器具有价值,但在深层神经网络的后续层中并不存在相关的空间结构,导致神经持久性近似等价于权重方差。此外,针对深层神经网络提出的跨层平均化处理方式并未考虑层间交互作用。基于上述分析,我们提出将神经持久性所依赖的过滤结构从单层扩展至整个神经网络——这等价于在特定矩阵上计算神经持久性。由此产生的深层图持久性度量能隐式整合网络中的持久路径,并通过标准化缓解方差相关问题。代码见https://github.com/ExplainableML/Deep-Graph-Persistence。