Dynamics and Representation Structure of Local Approximations to Gradient-Based Learning in Linear Recurrent Neural Networks

Biological and neuromorphic recurrent neural networks (RNNs) are subject to spatial and temporal locality constraints on the information that can plausibly be used during learning. A common strategy to satisfy these constraints is to modify gradient descent by neglecting non-local terms to varying degrees, as in random feedback local online (RFLO) learning and truncated backpropagation through time (tBPTT). However, the learning dynamics of these algorithms, and how they compare with BPTT, remain poorly understood. We apply dynamical systems theory to data-aligned linear RNNs -- whose dynamics can be separated into orthogonal modes -- to compare stationary solutions, stability properties, and convergence rates, finding qualitatively distinct behaviour for RFLO versus BPTT and one-step tBPTT. We further observe that the solutions learned by RFLO are restricted to low-rank perturbations of initial parameters, a result which holds beyond the data-aligned setting. Our work provides analytical insight into how locality constraints shape learning dynamics, with implications for neuroscientific models of learning and alternative optimization approaches for RNNs.

翻译：生物和神经形态递归神经网络（RNN）在学习过程中，其可用的信息受到空间和时间局部性约束。常见策略是通过不同程度地忽略非局部项来修改梯度下降，例如随机反馈局部在线学习（RFLO）和截断时间反向传播（tBPTT）。然而，这些算法的学习动力学及其与BPTT的比较仍知之甚少。我们应用动力系统理论于数据对齐的线性RNN（其动力学可分为正交模态），比较稳态解、稳定性性质和收敛速率，发现RFLO与BPTT及单步tBPTT存在定性不同的行为。进一步观察到，RFLO学习的解仅限于初始参数的低秩扰动，这一结果在数据对齐设置之外仍然成立。我们的工作提供了关于局部性约束如何塑造学习动力学的分析性见解，对神经科学学习模型及RNN替代优化方法具有启示意义。

相关内容

递归神经网络

关注 2

递归神经网络（RNN）是神经网络的一种。单纯的RNN因为无法处理随着递归，权重指数级爆炸或梯度消失问题，难以捕捉长期时间关联；而结合不同的LSTM可以很好解决这个问题。时间递归神经网络可以描述动态时间行为，因为和前馈神经网络（feedforward neural network）接受较特定结构的输入不同，RNN将状态在自身网络中循环传递，因此可以接受更广泛的时间序列结构输入。手写识别是最早成功利用RNN的研究结果。

【MIT-Stefanie Jegelka】图神经网络理论:表示与学习，48页ppt，附视频与Slides

专知会员服务

30+阅读 · 2022年11月7日

【芝加博士论文】图表示学习，图上的深度生成模型，组等变分子神经网络和多分辨率机器学习

专知会员服务

33+阅读 · 2022年11月5日

Nature. Mach. Intell. |基于梯度的学习通过平衡压缩和扩展来驱动循环神经网络中的鲁棒表示

专知会员服务

10+阅读 · 2022年6月23日

【干货书】《Transformers 机器学习:深度探究》，Transformers for Machine Learning A Deep Dive

专知会员服务

473+阅读 · 2022年4月21日