警惕时间序列预处理：标准化与缩放的表达性影响 (Be Wary of Your Time Series Preprocessing) - 专知论文

会员服务 ·

0

缩放 · 归一化 · 序列 · 时间序列 · 预处理 ·

Be Wary of Your Time Series Preprocessing

翻译：警惕时间序列预处理：标准化与缩放的表达性影响

Sofiane Ennadir,Tianze Wang,Oleg Smirnov,Sahar Asadi,Lele Cao

from arxiv, Accepted at the AI4TS workshop at AAAI-26

Normalization and scaling are fundamental preprocessing steps in time series modeling, yet their role in Transformer-based models remains underexplored from a theoretical perspective. In this work, we present the first formal analysis of how different normalization strategies, specifically instance-based and global scaling, impact the expressivity of Transformer-based architectures for time series representation learning. We propose a novel expressivity framework tailored to time series, which quantifies a model's ability to distinguish between similar and dissimilar inputs in the representation space. Using this framework, we derive theoretical bounds for two widely used normalization methods: Standard and Min-Max scaling. Our analysis reveals that the choice of normalization strategy can significantly influence the model's representational capacity, depending on the task and data characteristics. We complement our theory with empirical validation on classification and forecasting benchmarks using multiple Transformer-based models. Our results show that no single normalization method consistently outperforms others, and in some cases, omitting normalization entirely leads to superior performance. These findings highlight the critical role of preprocessing in time series learning and motivate the need for more principled normalization strategies tailored to specific tasks and datasets.

翻译：归一化与缩放是时间序列建模中的基础预处理步骤，然而它们在基于Transformer的模型中的作用从理论角度仍未得到充分探索。本研究首次对不同归一化策略（特别是基于实例的缩放与全局缩放）如何影响基于Transformer架构在时间序列表示学习中的表达能力进行了形式化分析。我们提出了一个专为时间序列设计的表达能力框架，该框架量化了模型在表示空间中区分相似与不相似输入的能力。基于此框架，我们推导了两种广泛使用的归一化方法（标准化缩放与最小-最大缩放）的理论边界。分析表明，归一化策略的选择会显著影响模型的表示能力，其效果取决于具体任务与数据特征。我们通过使用多种基于Transformer的模型在分类与预测基准测试上进行实证验证，补充了理论分析。结果显示，没有单一归一化方法能持续优于其他方法，在某些情况下完全省略归一化反而能获得更优性能。这些发现凸显了预处理在时间序列学习中的关键作用，并表明需要针对特定任务与数据集设计更具原则性的归一化策略。

0

相关内容

决策智能中的时间序列预测大模型

决策智能中的时间序列预测大模型

专知会员服务

34+阅读 · 1月7日

【AAAI2026】《SimDiff：用于时间序列点预测的更简单但更优的扩散模型》

【AAAI2026】《SimDiff：用于时间序列点预测的更简单但更优的扩散模型》

专知会员服务

14+阅读 · 2025年11月25日

用于时间序列预测的扩散模型：综述

用于时间序列预测的扩散模型：综述

专知会员服务

29+阅读 · 2025年7月22日

时间序列大模型综述

时间序列大模型综述

专知会员服务

46+阅读 · 2025年4月8日

图深度学习在时间序列处理中的应用：预测、重构与分析

图深度学习在时间序列处理中的应用：预测、重构与分析

专知会员服务

34+阅读 · 2024年11月30日

时间序列基础模型综述：用大型语言模型推广时间序列表征

时间序列基础模型综述：用大型语言模型推广时间序列表征

专知会员服务

68+阅读 · 2024年5月11日

【干货书】用于数据科学分析和预测的时间序列，529页pdf

【干货书】用于数据科学分析和预测的时间序列，529页pdf

专知会员服务

123+阅读 · 2022年10月10日

Transformers如何进行时序分析？Rowan大学最新《Transformers时序分析》综述

Transformers如何进行时序分析？Rowan大学最新《Transformers时序分析》综述

专知会员服务

86+阅读 · 2022年5月5日

时间序列预测方法综述

专知会员服务

237+阅读 · 2020年12月15日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

【Manning新书】 Python中时间序列预测，222页pdf手把手教你实战时序建模

【Manning新书】 Python中时间序列预测，222页pdf手把手教你实战时序建模

专知

28+阅读 · 2022年3月29日

时空序列预测方法综述

时空序列预测方法综述

专知

22+阅读 · 2020年10月19日

开源新书《时间序列分析，数据/方法/应用》，6章110页pdf带你了解最新进展，附下载

开源新书《时间序列分析，数据/方法/应用》，6章110页pdf带你了解最新进展，附下载

专知

91+阅读 · 2019年11月20日

实例：教你使用简单神经网络和LSTM进行时间序列预测（附代码）

实例：教你使用简单神经网络和LSTM进行时间序列预测（附代码）

数据分析

28+阅读 · 2019年5月23日

你真的懂时间序列预测吗？

你真的懂时间序列预测吗？

腾讯大讲堂

104+阅读 · 2019年1月7日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

基于 Keras 用深度学习预测时间序列

基于 Keras 用深度学习预测时间序列

R语言中文社区

23+阅读 · 2018年7月27日

教你搭建多变量时间序列预测模型LSTM（附代码、数据集）

教你搭建多变量时间序列预测模型LSTM（附代码、数据集）

数据派THU

59+阅读 · 2017年11月6日

回归预测&时间序列预测

回归预测&时间序列预测

GBASE数据工程部数据团队

44+阅读 · 2017年5月17日

基于连续时间PWA模型的混杂系统预测控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

稀疏信号驱动的时间序列信号盲分离优化模型及算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

混沌时间序列Volterra建模及其在语音信号处理中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

面向安全关键系统的时间可预测多核代码生成方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

数据中心延迟敏感型应用尾端响应时延服务质量保障方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

高维时间序列的降维与建模

国家自然科学基金

23+阅读 · 2015年12月31日

通用时序逻辑表达下的视频时空行为理解研究与应用

国家自然科学基金

0+阅读 · 2015年12月31日

基于时序空间关系的目标跟踪及遮挡识别研究

国家自然科学基金

6+阅读 · 2015年12月31日

面向时空变化的GIS数据模型

国家自然科学基金

6+阅读 · 2014年12月31日

时间序列数据挖掘中的聚类模型与算法研究

国家自然科学基金

14+阅读 · 2008年12月31日

Still Competitive: Revisiting Recurrent Models for Irregular Time Series Prediction

Still Competitive: Revisiting Recurrent Models for Irregular Time Series Prediction

Arxiv

0+阅读 · 2月18日

A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting

Arxiv

0+阅读 · 2月16日

SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting

Arxiv

0+阅读 · 2月14日

It's TIME: Towards the Next Generation of Time Series Forecasting Benchmarks

Arxiv

0+阅读 · 2月12日

StretchTime: Adaptive Time Series Forecasting via Symplectic Attention

Arxiv

0+阅读 · 2月9日

Revisiting the Generic Transformer: Deconstructing a Strong Baseline for Time Series Foundation Models

Arxiv

0+阅读 · 2月6日

Empowering Time Series Analysis with Large-Scale Multimodal Pretraining

Arxiv

0+阅读 · 2月5日

In-context Time Series Predictor

Arxiv

0+阅读 · 2月5日

PatchFormer: A Patch-Based Time Series Foundation Model with Hierarchical Masked Reconstruction and Cross-Domain Transfer Learning for Zero-Shot Multi-Horizon Forecasting

Arxiv

0+阅读 · 1月28日

Patch-Level Tokenization with CNN Encoders and Attention for Improved Transformer Time-Series Forecasting

Arxiv

0+阅读 · 1月21日

VIP会员

文章信息

相关主题

相关VIP内容

决策智能中的时间序列预测大模型

决策智能中的时间序列预测大模型

专知会员服务

34+阅读 · 1月7日

【AAAI2026】《SimDiff：用于时间序列点预测的更简单但更优的扩散模型》

【AAAI2026】《SimDiff：用于时间序列点预测的更简单但更优的扩散模型》

专知会员服务

14+阅读 · 2025年11月25日

用于时间序列预测的扩散模型：综述

用于时间序列预测的扩散模型：综述

专知会员服务

29+阅读 · 2025年7月22日

时间序列大模型综述

时间序列大模型综述

专知会员服务

46+阅读 · 2025年4月8日

图深度学习在时间序列处理中的应用：预测、重构与分析

图深度学习在时间序列处理中的应用：预测、重构与分析

专知会员服务

34+阅读 · 2024年11月30日

时间序列基础模型综述：用大型语言模型推广时间序列表征

时间序列基础模型综述：用大型语言模型推广时间序列表征

专知会员服务

68+阅读 · 2024年5月11日

【干货书】用于数据科学分析和预测的时间序列，529页pdf

【干货书】用于数据科学分析和预测的时间序列，529页pdf

专知会员服务

123+阅读 · 2022年10月10日

Transformers如何进行时序分析？Rowan大学最新《Transformers时序分析》综述

Transformers如何进行时序分析？Rowan大学最新《Transformers时序分析》综述

专知会员服务

86+阅读 · 2022年5月5日

时间序列预测方法综述

专知会员服务

237+阅读 · 2020年12月15日

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

【预训练论文】预训练Transformer校准，Calibration of Pre-trained Transformers

专知会员服务

26+阅读 · 2020年3月19日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体记忆深度剖析：评价指标与系统局限性的分类体系及实证分析

《可信人工智能赋能系统的支柱》

【CMU博士论文】可靠轨迹预测的分层基石：数据、评估与方法

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

相关资讯

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

【Manning新书】 Python中时间序列预测，222页pdf手把手教你实战时序建模

【Manning新书】 Python中时间序列预测，222页pdf手把手教你实战时序建模

专知

28+阅读 · 2022年3月29日

时空序列预测方法综述

时空序列预测方法综述

专知

22+阅读 · 2020年10月19日

开源新书《时间序列分析，数据/方法/应用》，6章110页pdf带你了解最新进展，附下载

开源新书《时间序列分析，数据/方法/应用》，6章110页pdf带你了解最新进展，附下载

专知

91+阅读 · 2019年11月20日

实例：教你使用简单神经网络和LSTM进行时间序列预测（附代码）

实例：教你使用简单神经网络和LSTM进行时间序列预测（附代码）

数据分析

28+阅读 · 2019年5月23日

你真的懂时间序列预测吗？

你真的懂时间序列预测吗？

腾讯大讲堂

104+阅读 · 2019年1月7日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

基于 Keras 用深度学习预测时间序列

基于 Keras 用深度学习预测时间序列

R语言中文社区

23+阅读 · 2018年7月27日

教你搭建多变量时间序列预测模型LSTM（附代码、数据集）

教你搭建多变量时间序列预测模型LSTM（附代码、数据集）

数据派THU

59+阅读 · 2017年11月6日

回归预测&时间序列预测

回归预测&时间序列预测

GBASE数据工程部数据团队

44+阅读 · 2017年5月17日

相关论文

Still Competitive: Revisiting Recurrent Models for Irregular Time Series Prediction

Still Competitive: Revisiting Recurrent Models for Irregular Time Series Prediction

Arxiv

0+阅读 · 2月18日

A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting

Arxiv

0+阅读 · 2月16日

SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting

Arxiv

0+阅读 · 2月14日

It's TIME: Towards the Next Generation of Time Series Forecasting Benchmarks

Arxiv

0+阅读 · 2月12日

StretchTime: Adaptive Time Series Forecasting via Symplectic Attention

Arxiv

0+阅读 · 2月9日

Revisiting the Generic Transformer: Deconstructing a Strong Baseline for Time Series Foundation Models

Arxiv

0+阅读 · 2月6日

Empowering Time Series Analysis with Large-Scale Multimodal Pretraining

Arxiv

0+阅读 · 2月5日

In-context Time Series Predictor

Arxiv

0+阅读 · 2月5日

PatchFormer: A Patch-Based Time Series Foundation Model with Hierarchical Masked Reconstruction and Cross-Domain Transfer Learning for Zero-Shot Multi-Horizon Forecasting

Arxiv

0+阅读 · 1月28日

Patch-Level Tokenization with CNN Encoders and Attention for Improved Transformer Time-Series Forecasting

Arxiv

0+阅读 · 1月21日

相关基金

基于连续时间PWA模型的混杂系统预测控制研究

国家自然科学基金

0+阅读 · 2015年12月31日

稀疏信号驱动的时间序列信号盲分离优化模型及算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

混沌时间序列Volterra建模及其在语音信号处理中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

面向安全关键系统的时间可预测多核代码生成方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

数据中心延迟敏感型应用尾端响应时延服务质量保障方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

高维时间序列的降维与建模

国家自然科学基金

23+阅读 · 2015年12月31日

通用时序逻辑表达下的视频时空行为理解研究与应用

国家自然科学基金

0+阅读 · 2015年12月31日

基于时序空间关系的目标跟踪及遮挡识别研究

国家自然科学基金

6+阅读 · 2015年12月31日

面向时空变化的GIS数据模型

国家自然科学基金

6+阅读 · 2014年12月31日

时间序列数据挖掘中的聚类模型与算法研究

国家自然科学基金

14+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员