Dense Neural Networks are not Universal Approximators

We investigate the approximation capabilities of dense neural networks. While universal approximation theorems establish that sufficiently large architectures can approximate arbitrary continuous functions if there are no restrictions on the weight values, we show that dense neural networks do not possess this universality. Our argument is based on a model compression approach, combining the weak regularity lemma with an interpretation of feedforward networks as message passing graph neural networks. We consider ReLU neural networks subject to natural constraints on weights and input and output dimensions, which model a notion of dense connectivity. Within this setting, we demonstrate the existence of Lipschitz continuous functions that cannot be approximated by such networks. This highlights intrinsic limitations of neural networks with dense layers and motivates the use of sparse connectivity as a necessary ingredient for achieving true universality.

翻译：本研究探讨了稠密神经网络的逼近能力。尽管通用逼近定理表明，若对权重值无限制，足够大的网络架构能够逼近任意连续函数，但我们证明稠密神经网络并不具备这种通用性。我们的论证基于模型压缩方法，将弱正则性引理与前馈网络作为消息传递图神经网络的解释相结合。我们考虑受权重及输入输出维度自然约束的ReLU神经网络，这些约束建模了稠密连接的概念。在此设定下，我们证明了存在无法被此类网络逼近的Lipschitz连续函数。这揭示了具有稠密层的神经网络的内在局限性，并表明稀疏连接是实现真正通用逼近的必要条件。

相关内容

神经网络

关注 5917

人工神经网络（Artificial Neural Network，即ANN ），是20世纪80 年代以来人工智能领域兴起的研究热点。它从信息处理角度对人脑神经元网络进行抽象，建立某种简单模型，按不同的连接方式组成不同的网络。在工程与学术界也常直接简称为神经网络或类神经网络。神经网络是一种运算模型，由大量的节点（或称神经元）之间相互联接构成。每个节点代表一种特定的输出函数，称为激励函数（activation function）。每两个节点间的连接都代表一个对于通过该连接信号的加权值，称之为权重，这相当于人工神经网络的记忆。网络的输出则依网络的连接方式，权重值和激励函数的不同而不同。而网络自身通常都是对自然界某种算法或者函数的逼近，也可能是对一种逻辑策略的表达。最近十多年来，人工神经网络的研究工作不断深入，已经取得了很大的进展，其在模式识别、智能机器人、自动控制、预测估计、生物、医学、经济等领域已成功地解决了许多现代计算机难以解决的实际问题，表现出了良好的智能特性。

【深度神经网络加速器的硬件近似技术综述】Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey

专知会员服务

16+阅读 · 2022年3月17日

【牛津大学Michael Bronstein教授】超越Weisfeiler-Lehman和普通信息传递的图神经网络，Graph Neural Networks beyond Weisfeiler-Lehman and vanilla Message Passing

专知会员服务

30+阅读 · 2022年3月4日

非深度学习！普林斯顿、英特尔提出ParNet，速度和准确性显著优于ResNet

专知会员服务

23+阅读 · 2021年11月9日

基于粗粒度数据流架构的稀疏卷积神经网络加速

专知会员服务

23+阅读 · 2021年7月15日