迭代幅度剪枝如何在全连接神经网络中发现局部感受野 (On How Iterative Magnitude Pruning Discovers Local Receptive Fields in Fully Connected Neural Networks)

Since its use in the Lottery Ticket Hypothesis, iterative magnitude pruning (IMP) has become a popular method for extracting sparse subnetworks that can be trained to high performance. Despite this, the underlying nature of IMP's general success remains unclear. One possibility is that IMP is especially capable of extracting and maintaining strong inductive biases. In support of this, recent work has shown that applying IMP to fully connected neural networks (FCNs) leads to the emergence of local receptive fields (RFs), an architectural feature present in mammalian visual cortex and convolutional neural networks. The question of how IMP is able to do this remains unanswered. Inspired by results showing that training FCNs on synthetic images with highly non-Gaussian statistics (e.g., sharp edges) is sufficient to drive the formation of local RFs, we hypothesize that IMP iteratively increases the non-Gaussian statistics present in the representations of FCNs, creating a feedback loop that enhances localization. We develop a new method for measuring the effect of individual weights on the statistics of the FCN representations ("cavity method"), which allows us to find evidence in support of this hypothesis. Our work, which is the first to study the effect IMP has on the statistics of the representations of neural networks, sheds parsimonious light on one way in which IMP can drive the formation of strong inductive biases.

翻译：自其在彩票假设中被使用以来，迭代幅度剪枝已成为一种提取稀疏子网络的流行方法，这些子网络能够被训练至高性能。尽管如此，IMP普遍成功的根本原因仍不明确。一种可能性是IMP特别擅长提取并保持强大的归纳偏置。支持这一观点的是，近期研究表明，将IMP应用于全连接神经网络会导致局部感受野的出现，这是哺乳动物视觉皮层和卷积神经网络中存在的一种架构特征。IMP如何实现这一点的问题仍未得到解答。受先前结果的启发——这些结果表明，在具有高度非高斯统计特性（例如锐利边缘）的合成图像上训练FCN足以驱动局部RF的形成——我们假设，IMP迭代地增加了FCN表征中存在的非高斯统计特性，从而创建一个增强局部化的反馈循环。我们开发了一种新方法，用于测量单个权重对FCN表征统计特性的影响（“空腔法”），这使我们能够找到支持该假设的证据。我们的工作是首个研究IMP对神经网络表征统计特性影响的研究，它以一种简约的方式阐明了IMP能够驱动强归纳偏置形成的一种途径。

相关内容

Neural Networks

关注 1653

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日