TimeDistill：基于跨架构蒸馏的高效长时序预测MLP方法 (TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation) - 专知论文

会员服务 ·

0

蒸馏 · 时序 · 时序预测 · CNN · 知识 ·

TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation

翻译：TimeDistill：基于跨架构蒸馏的高效长时序预测MLP方法

Juntong Ni,Zewen Liu,Shiyu Wang,Ming Jin,Wei Jin

from arxiv, Accepted at KDD 2026, we release our code publicly at https://github.com/LingFengGold/TimeDistill

Transformer-based and CNN-based methods demonstrate strong performance in long-term time series forecasting. However, their high computational and storage requirements can hinder large-scale deployment. To address this limitation, we propose integrating lightweight MLP with advanced architectures using knowledge distillation (KD). Our preliminary study reveals different models can capture complementary patterns, particularly multi-scale and multi-period patterns in the temporal and frequency domains. Based on this observation, we introduce TimeDistill, a cross-architecture KD framework that transfers these patterns from teacher models (e.g., Transformers, CNNs) to MLP. Additionally, we provide a theoretical analysis, demonstrating that our KD approach can be interpreted as a specialized form of mixup data augmentation. TimeDistill improves MLP performance by up to 18.6%, surpassing teacher models on eight datasets. It also achieves up to 7X faster inference and requires 130X fewer parameters. Furthermore, we conduct extensive evaluations to highlight the versatility and effectiveness of TimeDistill.

翻译：基于Transformer和CNN的方法在长时序预测中展现出优异性能，但其较高的计算与存储需求制约了大规模部署。为突破此限制，我们提出通过知识蒸馏（KD）将轻量级MLP与先进架构相融合。初步研究表明，不同模型能捕捉互补的时序模式，特别是在时域与频域中的多尺度与多周期模式。基于此发现，我们提出跨架构蒸馏框架TimeDistill，将教师模型（如Transformer、CNN）中的模式知识迁移至MLP。此外，我们通过理论分析证明该蒸馏方法可视为混合数据增强的特殊形式。TimeDistill将MLP性能提升最高达18.6%，在八个数据集上超越教师模型，同时实现最高7倍的推理加速与130倍的参数缩减。大量实验进一步验证了TimeDistill的通用性与有效性。

0

相关内容

决策智能中的时间序列预测大模型

决策智能中的时间序列预测大模型

专知会员服务

34+阅读 · 1月7日

基于大语言模型的时序知识图谱推理模型蒸馏方法

基于大语言模型的时序知识图谱推理模型蒸馏方法

专知会员服务

36+阅读 · 2025年1月10日

深度学习和基础模型在时间序列预测中的综述

深度学习和基础模型在时间序列预测中的综述

专知会员服务

50+阅读 · 2024年1月26日

时序挖掘如何预训练？华南理工最新《时间序列预训练模型》综述，29页pdf详述时序预训练方法体系

时序挖掘如何预训练？华南理工最新《时间序列预训练模型》综述，29页pdf详述时序预训练方法体系

专知会员服务

85+阅读 · 2023年5月22日

【AI+金融】《将深度神经网络应用于金融时序预测》斯坦福

【AI+金融】《将深度神经网络应用于金融时序预测》斯坦福

专知会员服务

63+阅读 · 2022年4月27日

【AAAI2021最佳论文】基于高效 Transformer 的长时间序列预测

【AAAI2021最佳论文】基于高效 Transformer 的长时间序列预测

专知会员服务

62+阅读 · 2021年2月6日

【牛津大学】深度学习时间序列预测，12页pdf, Deep Learning Time Series Forecasting

【牛津大学】深度学习时间序列预测，12页pdf, Deep Learning Time Series Forecasting

专知会员服务

174+阅读 · 2020年5月1日

【牛津大学】深度学习时间序列预测，Time Series Forecasting With Deep Learning: A Survey

【牛津大学】深度学习时间序列预测，Time Series Forecasting With Deep Learning: A Survey

专知会员服务

142+阅读 · 2020年4月30日

金融时序预测中的深度学习方法：2005到2019

金融时序预测中的深度学习方法：2005到2019

专知会员服务

168+阅读 · 2019年12月4日

【ECML-PKDD 2019】多维时间序列和事件日志的模式挖掘和异常检测框架（A framework for pattern mining and anomalydetection in multi-dimensional time series andevent logs）

【ECML-PKDD 2019】多维时间序列和事件日志的模式挖掘和异常检测框架（A framework for pattern mining and anomalydetection in multi-dimensional time series andevent logs）

专知会员服务

38+阅读 · 2019年12月1日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

【CMU-Amazon】时间序列预测：理论与实践，379页ppt阐述大规模时序预测工具与方法

【CMU-Amazon】时间序列预测：理论与实践，379页ppt阐述大规模时序预测工具与方法

专知

31+阅读 · 2020年4月24日

图卷积神经网络蒸馏知识，Distillating Knowledge from GCN

图卷积神经网络蒸馏知识，Distillating Knowledge from GCN

专知

41+阅读 · 2020年3月25日

【MIT-伯克利-ICLR2020】对比表示蒸馏，Contrastive Representation Distillation

【MIT-伯克利-ICLR2020】对比表示蒸馏，Contrastive Representation Distillation

专知

54+阅读 · 2020年3月12日

金融时序预测中的深度学习方法综述: 从2005到2019，附63页pdf下载

金融时序预测中的深度学习方法综述: 从2005到2019，附63页pdf下载

专知

70+阅读 · 2019年12月4日

基于LSTM深层神经网络的时间序列预测

基于LSTM深层神经网络的时间序列预测

论智

22+阅读 · 2018年9月4日

基于 Keras 用深度学习预测时间序列

基于 Keras 用深度学习预测时间序列

R语言中文社区

23+阅读 · 2018年7月27日

教你搭建多变量时间序列预测模型LSTM（附代码、数据集）

教你搭建多变量时间序列预测模型LSTM（附代码、数据集）

数据派THU

59+阅读 · 2017年11月6日

教程 | 基于Keras的LSTM多变量时间序列预测

教程 | 基于Keras的LSTM多变量时间序列预测

机器之心

20+阅读 · 2017年10月30日

如何在Python中用LSTM网络进行时间序列预测

如何在Python中用LSTM网络进行时间序列预测

AI100

17+阅读 · 2017年8月5日

基于时变回声状态网的光伏发电在线短期预测方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于相空间挤压策略的空间信号时频分析与参数估计方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

新型快速高稳定性时域积分方程算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于深度卷积神经网络的多源遥感图像时空融合方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于连续时间PWA模型的混杂系统预测控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Memetic多目标时变优化的全基因代谢网络重构算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向安全关键系统的时间可预测多核代码生成方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

高维时间序列的降维与建模

国家自然科学基金

23+阅读 · 2015年12月31日

面向大数据的高时效并行计算机系统结构与技术

国家自然科学基金

0+阅读 · 2014年12月31日

超线性增长条件下的混杂型随机时滞微分方程

国家自然科学基金

0+阅读 · 2014年12月31日

T-LLM: Teaching Large Language Models to Forecast Time Series via Temporal Distillation

Arxiv

0+阅读 · 2月2日

AverageTime: Enhance Long-Term Time Series Forecasting with Simple Averaging

Arxiv

0+阅读 · 1月31日

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

Arxiv

0+阅读 · 1月29日

MoHETS: Long-term Time Series Forecasting with Mixture-of-Heterogeneous-Experts

Arxiv

0+阅读 · 1月29日

PatchFormer: A Patch-Based Time Series Foundation Model with Hierarchical Masked Reconstruction and Cross-Domain Transfer Learning for Zero-Shot Multi-Horizon Forecasting

Arxiv

0+阅读 · 1月28日

TimeCatcher: A Variational Framework for Volatility-Aware Forecasting of Non-Stationary Time Series

Arxiv

0+阅读 · 1月28日

ScatterFusion: A Hierarchical Scattering Transform Framework for Enhanced Time Series Forecasting

Arxiv

0+阅读 · 1月28日

FaST: Efficient and Effective Long-Horizon Forecasting for Large-Scale Spatial-Temporal Graphs via Mixture-of-Experts

Arxiv

0+阅读 · 1月8日

TimeMosaic: Temporal Heterogeneity Guided Time Series Forecasting via Adaptive Granularity Patch and Segment-wise Decoding

Arxiv

0+阅读 · 1月5日

Sequential Reservoir Computing for Efficient High-Dimensional Spatiotemporal Forecasting

Arxiv

0+阅读 · 1月1日

VIP会员

文章信息

相关主题

相关VIP内容

决策智能中的时间序列预测大模型

决策智能中的时间序列预测大模型

专知会员服务

34+阅读 · 1月7日

基于大语言模型的时序知识图谱推理模型蒸馏方法

基于大语言模型的时序知识图谱推理模型蒸馏方法

专知会员服务

36+阅读 · 2025年1月10日

深度学习和基础模型在时间序列预测中的综述

深度学习和基础模型在时间序列预测中的综述

专知会员服务

50+阅读 · 2024年1月26日

时序挖掘如何预训练？华南理工最新《时间序列预训练模型》综述，29页pdf详述时序预训练方法体系

时序挖掘如何预训练？华南理工最新《时间序列预训练模型》综述，29页pdf详述时序预训练方法体系

专知会员服务

85+阅读 · 2023年5月22日

【AI+金融】《将深度神经网络应用于金融时序预测》斯坦福

【AI+金融】《将深度神经网络应用于金融时序预测》斯坦福

专知会员服务

63+阅读 · 2022年4月27日

【AAAI2021最佳论文】基于高效 Transformer 的长时间序列预测

【AAAI2021最佳论文】基于高效 Transformer 的长时间序列预测

专知会员服务

62+阅读 · 2021年2月6日

【牛津大学】深度学习时间序列预测，12页pdf, Deep Learning Time Series Forecasting

【牛津大学】深度学习时间序列预测，12页pdf, Deep Learning Time Series Forecasting

专知会员服务

174+阅读 · 2020年5月1日

【牛津大学】深度学习时间序列预测，Time Series Forecasting With Deep Learning: A Survey

【牛津大学】深度学习时间序列预测，Time Series Forecasting With Deep Learning: A Survey

专知会员服务

142+阅读 · 2020年4月30日

金融时序预测中的深度学习方法：2005到2019

金融时序预测中的深度学习方法：2005到2019

专知会员服务

168+阅读 · 2019年12月4日

【ECML-PKDD 2019】多维时间序列和事件日志的模式挖掘和异常检测框架（A framework for pattern mining and anomalydetection in multi-dimensional time series andevent logs）

【ECML-PKDD 2019】多维时间序列和事件日志的模式挖掘和异常检测框架（A framework for pattern mining and anomalydetection in multi-dimensional time series andevent logs）

专知会员服务

38+阅读 · 2019年12月1日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体记忆深度剖析：评价指标与系统局限性的分类体系及实证分析

《可信人工智能赋能系统的支柱》

【CMU博士论文】可靠轨迹预测的分层基石：数据、评估与方法

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

相关资讯

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

【CMU-Amazon】时间序列预测：理论与实践，379页ppt阐述大规模时序预测工具与方法

【CMU-Amazon】时间序列预测：理论与实践，379页ppt阐述大规模时序预测工具与方法

专知

31+阅读 · 2020年4月24日

图卷积神经网络蒸馏知识，Distillating Knowledge from GCN

图卷积神经网络蒸馏知识，Distillating Knowledge from GCN

专知

41+阅读 · 2020年3月25日

【MIT-伯克利-ICLR2020】对比表示蒸馏，Contrastive Representation Distillation

【MIT-伯克利-ICLR2020】对比表示蒸馏，Contrastive Representation Distillation

专知

54+阅读 · 2020年3月12日

金融时序预测中的深度学习方法综述: 从2005到2019，附63页pdf下载

金融时序预测中的深度学习方法综述: 从2005到2019，附63页pdf下载

专知

70+阅读 · 2019年12月4日

基于LSTM深层神经网络的时间序列预测

基于LSTM深层神经网络的时间序列预测

论智

22+阅读 · 2018年9月4日

基于 Keras 用深度学习预测时间序列

基于 Keras 用深度学习预测时间序列

R语言中文社区

23+阅读 · 2018年7月27日

教你搭建多变量时间序列预测模型LSTM（附代码、数据集）

教你搭建多变量时间序列预测模型LSTM（附代码、数据集）

数据派THU

59+阅读 · 2017年11月6日

教程 | 基于Keras的LSTM多变量时间序列预测

教程 | 基于Keras的LSTM多变量时间序列预测

机器之心

20+阅读 · 2017年10月30日

如何在Python中用LSTM网络进行时间序列预测

如何在Python中用LSTM网络进行时间序列预测

AI100

17+阅读 · 2017年8月5日

相关论文

T-LLM: Teaching Large Language Models to Forecast Time Series via Temporal Distillation

Arxiv

0+阅读 · 2月2日

AverageTime: Enhance Long-Term Time Series Forecasting with Simple Averaging

Arxiv

0+阅读 · 1月31日

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

Arxiv

0+阅读 · 1月29日

MoHETS: Long-term Time Series Forecasting with Mixture-of-Heterogeneous-Experts

Arxiv

0+阅读 · 1月29日

PatchFormer: A Patch-Based Time Series Foundation Model with Hierarchical Masked Reconstruction and Cross-Domain Transfer Learning for Zero-Shot Multi-Horizon Forecasting

Arxiv

0+阅读 · 1月28日

TimeCatcher: A Variational Framework for Volatility-Aware Forecasting of Non-Stationary Time Series

Arxiv

0+阅读 · 1月28日

ScatterFusion: A Hierarchical Scattering Transform Framework for Enhanced Time Series Forecasting

Arxiv

0+阅读 · 1月28日

FaST: Efficient and Effective Long-Horizon Forecasting for Large-Scale Spatial-Temporal Graphs via Mixture-of-Experts

Arxiv

0+阅读 · 1月8日

TimeMosaic: Temporal Heterogeneity Guided Time Series Forecasting via Adaptive Granularity Patch and Segment-wise Decoding

Arxiv

0+阅读 · 1月5日

Sequential Reservoir Computing for Efficient High-Dimensional Spatiotemporal Forecasting

Arxiv

0+阅读 · 1月1日

相关基金

基于时变回声状态网的光伏发电在线短期预测方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于相空间挤压策略的空间信号时频分析与参数估计方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

新型快速高稳定性时域积分方程算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于深度卷积神经网络的多源遥感图像时空融合方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于连续时间PWA模型的混杂系统预测控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于Memetic多目标时变优化的全基因代谢网络重构算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向安全关键系统的时间可预测多核代码生成方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

高维时间序列的降维与建模

国家自然科学基金

23+阅读 · 2015年12月31日

面向大数据的高时效并行计算机系统结构与技术

国家自然科学基金

0+阅读 · 2014年12月31日

超线性增长条件下的混杂型随机时滞微分方程

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员