Seeking Next Layer Neurons' Attention for Error-Backpropagation-Like Training in a Multi-Agent Network Framework

Despite considerable theoretical progress in the training of neural networks viewed as a multi-agent system of neurons, particularly concerning biological plausibility and decentralized training, their applicability to real-world problems remains limited due to scalability issues. In contrast, error-backpropagation has demonstrated its effectiveness for training deep networks in practice. In this study, we propose a local objective for neurons that, when pursued by neurons individually, align them to exhibit similarities to error-backpropagation in terms of efficiency and scalability during training. For this purpose, we examine a neural network comprising decentralized, self-interested neurons seeking to maximize their local objective -- attention from subsequent layer neurons -- and identify the optimal strategy for neurons. We also analyze the relationship between this strategy and backpropagation, establishing conditions under which the derived strategy is equivalent to error-backpropagation. Lastly, we demonstrate the learning capacity of these multi-agent neural networks through experiments on three datasets and showcase their superior performance relative to error-backpropagation in a catastrophic forgetting benchmark.

翻译：尽管将神经网络视为神经元的多智能体系统在理论研究上取得了显著进展（尤其是在生物合理性和分散训练方面），但由于可扩展性问题，其在实际问题中的适用性仍然有限。相比之下，误差反向传播在实践中已证明了其在训练深度网络方面的有效性。在本研究中，我们提出了一种局部目标函数，当神经元各自追求该目标时，它们能在训练过程中表现出与误差反向传播相似的效率和可扩展性。为此，我们考察了一个由分散、自利神经元组成的神经网络——这些神经元试图最大化其局部目标（即来自后续层神经元的注意力），并确定了神经元的最优策略。我们还分析了该策略与反向传播之间的关系，建立了所推导策略与误差反向传播等价的条件。最后，我们通过三个数据集上的实验展示了这些多智能体神经网络的学习能力，并在灾难性遗忘基准测试中证明了其相对于误差反向传播的优越性能。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日