Rethinking Intelligence: Brain-like Neuron Network

Since their inception, artificial neural networks have relied on manually designed architectures and inductive biases to better adapt to data and tasks. With the rise of deep learning and the expansion of parameter spaces, they have begun to exhibit brain-like functional behaviors. Nevertheless, artificial neural networks remain fundamentally different from biological neural systems in structural organization, learning mechanisms, and evolutionary pathways. From the perspective of neuroscience, we rethink the formation and evolution of intelligence and proposes a new neural network paradigm, Brain-like Neural Network (BNN). We further present the first instantiation of a BNN termed LuminaNet that operates without convolutions or self-attention and is capable of autonomously modifying its architecture. We conduct extensive experiments demonstrating that LuminaNet can achieve self-evolution through dynamic architectural changes. On the CIFAR-10, LuminaNet achieves top-1 accuracy improvements of 11.19%, 5.46% over LeNet-5 and AlexNet, respectively, outperforming MLP-Mixer, ResMLP, and DeiT-Tiny among MLP/ViT architectures. On the TinyStories text generation task, LuminaNet attains a perplexity of 8.4, comparable to a single-layer GPT-2-style Transformer, while reducing computational cost by approximately 25% and peak memory usage by nearly 50%. Code and interactive structures are available at https://github.com/aaroncomo/LuminaNet.

翻译：自诞生以来，人工神经网络一直依赖于人工设计的架构和归纳偏置，以更好地适应数据和任务。随着深度学习的兴起和参数空间的扩展，它们开始展现出类脑的功能行为。然而，人工神经网络在结构组织、学习机制和演化路径上，与生物神经系统仍存在根本性差异。从神经科学的角度出发，我们重新思考智能的形成与演化，并提出一种新的神经网络范式——类脑神经网络（BNN）。我们进一步提出了首个BNN实例，命名为LuminaNet，该网络无需卷积或自注意力机制即可运行，并能够自主修改其架构。我们进行了大量实验，证明LuminaNet能够通过动态架构变化实现自我演化。在CIFAR-10数据集上，LuminaNet的top-1准确率分别比LeNet-5和AlexNet提高了11.19%和5.46%，在MLP/ViT架构中超越了MLP-Mixer、ResMLP和DeiT-Tiny。在TinyStories文本生成任务中，LuminaNet达到了8.4的困惑度，与单层GPT-2风格的Transformer相当，同时计算成本降低了约25%，峰值内存使用量减少了近50%。代码和交互结构可在https://github.com/aaroncomo/LuminaNet获取。

相关内容

神经网络

关注 5918

人工神经网络（Artificial Neural Network，即ANN ），是20世纪80 年代以来人工智能领域兴起的研究热点。它从信息处理角度对人脑神经元网络进行抽象，建立某种简单模型，按不同的连接方式组成不同的网络。在工程与学术界也常直接简称为神经网络或类神经网络。神经网络是一种运算模型，由大量的节点（或称神经元）之间相互联接构成。每个节点代表一种特定的输出函数，称为激励函数（activation function）。每两个节点间的连接都代表一个对于通过该连接信号的加权值，称之为权重，这相当于人工神经网络的记忆。网络的输出则依网络的连接方式，权重值和激励函数的不同而不同。而网络自身通常都是对自然界某种算法或者函数的逼近，也可能是对一种逻辑策略的表达。最近十多年来，人工神经网络的研究工作不断深入，已经取得了很大的进展，其在模式识别、智能机器人、自动控制、预测估计、生物、医学、经济等领域已成功地解决了许多现代计算机难以解决的实际问题，表现出了良好的智能特性。

神经网络如何推理算法？DeepMind Petar等LoG 2022 《神经算法推理》教程，系统性讲解神经网络与经典算法结合

专知会员服务

31+阅读 · 2022年12月22日

【牛津大学博士论文】神经网络中的无监督学习和连续学习，153页pdf

专知会员服务

95+阅读 · 2022年10月5日

【Science Advances】MIT最新论文《特化类脑功能在深度神经网络中自发应用》，人脸识别的优化解决方案

专知会员服务

16+阅读 · 2022年4月10日

DeepMind发69页长文掀开AlphaZero的黑盒：神经网络学到的知识和人类基本相似！

专知会员服务

35+阅读 · 2021年12月7日