TSMixer: An all-MLP Architecture for Time Series Forecasting

Real-world time-series datasets are often multivariate with complex dynamics. To capture this complexity, high capacity architectures like recurrent- or attention-based sequential deep learning models have become popular. However, recent work demonstrates that simple univariate linear models can outperform such deep learning models on several commonly used academic benchmarks. Extending them, in this paper, we investigate the capabilities of linear models for time-series forecasting and present Time-Series Mixer (TSMixer), a novel architecture designed by stacking multi-layer perceptrons (MLPs). TSMixer is based on mixing operations along both the time and feature dimensions to extract information efficiently. On popular academic benchmarks, the simple-to-implement TSMixer is comparable to specialized state-of-the-art models that leverage the inductive biases of specific benchmarks. On the challenging and large scale M5 benchmark, a real-world retail dataset, TSMixer demonstrates superior performance compared to the state-of-the-art alternatives. Our results underline the importance of efficiently utilizing cross-variate and auxiliary information for improving the performance of time series forecasting. We present various analyses to shed light into the capabilities of TSMixer. The design paradigms utilized in TSMixer are expected to open new horizons for deep learning-based time series forecasting. The implementation is available at https://github.com/google-research/google-research/tree/master/tsmixer

翻译：现实世界中的时间序列数据集通常是多变量且具有复杂动态特性的。为捕捉这种复杂性，基于循环或注意力机制的序列深度学习模型等高容量架构已变得流行。然而，近期研究表明，简单的单变量线性模型在多个常用学术基准上可超越此类深度学习模型。在此基础上，本文探究了线性模型在时间序列预测中的能力，并提出了一种通过堆叠多层感知机（MLP）设计的新颖架构——时间序列混合器（TSMixer）。TSMixer基于沿时间维度和特征维度的混合操作高效提取信息。在常用学术基准上，易于实现的TSMixer与利用特定基准归纳偏置的专业化最先进模型性能相当。在具有挑战性的大规模M5基准（一个真实世界的零售数据集）上，TSMixer相比最先进替代方案展现了更优性能。我们的研究结果强调了高效利用跨变量信息和辅助信息对于提升时间序列预测性能的重要性。我们通过多种分析揭示了TSMixer的能力。TSMixer所采用的设计范式有望为基于深度学习的时间序列预测开辟新方向。该实现可访问https://github.com/google-research/google-research/tree/master/tsmixer。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日