Sequence Learning using Equilibrium Propagation

Equilibrium Propagation (EP) is a powerful and more bio-plausible alternative to conventional learning frameworks such as backpropagation. The effectiveness of EP stems from the fact that it relies only on local computations and requires solely one kind of computational unit during both of its training phases, thereby enabling greater applicability in domains such as bio-inspired neuromorphic computing. The dynamics of the model in EP is governed by an energy function and the internal states of the model consequently converge to a steady state following the state transition rules defined by the same. However, by definition, EP requires the input to the model (a convergent RNN) to be static in both the phases of training. Thus it is not possible to design a model for sequence classification using EP with an LSTM or GRU like architecture. In this paper, we leverage recent developments in modern hopfield networks to further understand energy based models and develop solutions for complex sequence classification tasks using EP while satisfying its convergence criteria and maintaining its theoretical similarities with recurrent backpropagation. We explore the possibility of integrating modern hopfield networks as an attention mechanism with convergent RNN models used in EP, thereby extending its applicability for the first time on two different sequence classification tasks in natural language processing viz. sentiment analysis (IMDB dataset) and natural language inference (SNLI dataset).

翻译：平衡传播（EP）是传统学习框架（如反向传播）的一种更强大且更具生物合理性的替代方案。EP的有效性源于其仅依赖局部计算，并且在两个训练阶段中只需使用同一种计算单元，从而在生物启发式神经形态计算等领域具有更强的适用性。EP中模型的动态由能量函数控制，模型内部状态根据该能量函数定义的状态转移规则收敛至稳态。然而，依据定义，EP要求模型（一个收敛的RNN）的输入在两个训练阶段中均保持静态。因此，无法使用EP设计基于LSTM或GRU架构的序列分类模型。本文利用现代Hopfield网络的最新进展，进一步理解基于能量的模型，并为使用EP解决复杂序列分类任务开发解决方案，同时满足其收敛条件并保持与循环反向传播的理论相似性。我们探索将现代Hopfield网络作为注意力机制与EP中使用的收敛RNN模型相结合的可能性，从而首次将其应用扩展到自然语言处理中的两个不同序列分类任务：情感分析（IMDB数据集）和自然语言推理（SNLI数据集）。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【UMD开放书】机器学习课程书册，19章227页pdf，带你学习ML

专知会员服务

102+阅读 · 2019年12月9日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日