Complexity-calibrated Benchmarks for Machine Learning Reveal When Next-Generation Reservoir Computer Predictions Succeed and Mislead

Recurrent neural networks are used to forecast time series in finance, climate, language, and from many other domains. Reservoir computers are a particularly easily trainable form of recurrent neural network. Recently, a "next-generation" reservoir computer was introduced in which the memory trace involves only a finite number of previous symbols. We explore the inherent limitations of finite-past memory traces in this intriguing proposal. A lower bound from Fano's inequality shows that, on highly non-Markovian processes generated by large probabilistic state machines, next-generation reservoir computers with reasonably long memory traces have an error probability that is at least ~ 60% higher than the minimal attainable error probability in predicting the next observation. More generally, it appears that popular recurrent neural networks fall far short of optimally predicting such complex processes. These results highlight the need for a new generation of optimized recurrent neural network architectures. Alongside this finding, we present concentration-of-measure results for randomly-generated but complex processes. One conclusion is that large probabilistic state machines -- specifically, large $\epsilon$-machines -- are key to generating challenging and structurally-unbiased stimuli for ground-truthing recurrent neural network architectures.

翻译：循环神经网络被用于金融、气候、语言及众多其他领域的时间序列预测。储层计算是一种特别易于训练的循环神经网络形式。近期提出的"下一代"储层计算方案中，其记忆痕迹仅涉及有限数量的先前符号。我们探究了这种引人注目的方案中有限过去记忆痕迹的内在局限性。基于Fano不等式的下界表明，在大型概率状态机产生的高度非马尔可夫过程中，具有合理长记忆痕迹的下一代储层计算在预测下一个观测值时，其误差概率至少比最小可达误差概率高出约60%。更普遍而言，流行的循环神经网络在最优预测此类复杂过程方面仍存在显著不足。这些结果凸显了开发新一代优化循环神经网络架构的必要性。伴随这一发现，我们提出了针对随机生成但复杂过程的测度集中结果。结论之一表明：大型概率状态机——特别是大型$\epsilon$-机——是生成具有挑战性且结构无偏刺激以验证循环神经网络架构的关键工具。

相关内容

递归神经网络

关注 2

递归神经网络（RNN）是神经网络的一种。单纯的RNN因为无法处理随着递归，权重指数级爆炸或梯度消失问题，难以捕捉长期时间关联；而结合不同的LSTM可以很好解决这个问题。时间递归神经网络可以描述动态时间行为，因为和前馈神经网络（feedforward neural network）接受较特定结构的输入不同，RNN将状态在自身网络中循环传递，因此可以接受更广泛的时间序列结构输入。手写识别是最早成功利用RNN的研究结果。

【干货书】机器学习练习册，211页pdf，Exercises in Machine Learning

专知会员服务

111+阅读 · 2022年10月5日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

35+阅读 · 2022年3月5日

【经典书】机器学习白话书，97页pdf，Machine Learning for Humans

专知会员服务

88+阅读 · 2021年1月11日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日