Multi-Patch Prediction: Adapting LLMs for Time Series Representation Learning

In this study, we present aLLM4TS, an innovative framework that adapts Large Language Models (LLMs) for time-series representation learning. Central to our approach is that we reconceive time-series forecasting as a self-supervised, multi-patch prediction task, which, compared to traditional mask-and-reconstruction methods, captures temporal dynamics in patch representations more effectively. Our strategy encompasses two-stage training: (i). a causal continual pre-training phase on various time-series datasets, anchored on next patch prediction, effectively syncing LLM capabilities with the intricacies of time-series data; (ii). fine-tuning for multi-patch prediction in the targeted time-series context. A distinctive element of our framework is the patch-wise decoding layer, which departs from previous methods reliant on sequence-level decoding. Such a design directly transposes individual patches into temporal sequences, thereby significantly bolstering the model's proficiency in mastering temporal patch-based representations. aLLM4TS demonstrates superior performance in several downstream tasks, proving its effectiveness in deriving temporal representations with enhanced transferability and marking a pivotal advancement in the adaptation of LLMs for time-series analysis.

翻译：在本研究中，我们提出了aLLM4TS，一个创新的框架，用于适配大型语言模型（LLMs）进行时间序列表示学习。我们方法的核心是将时间序列预测重新构想为一种自监督的多补丁预测任务，与传统掩码-重构方法相比，该方法能更有效地捕捉补丁表示中的时间动态。我们的策略包括两个阶段的训练：(i) 一种基于下一补丁预测的因果连续预训练阶段，在多种时间序列数据集上进行，有效将LLM的能力与时间序列数据的复杂性同步；(ii) 针对特定时间序列上下文的微调，用于多补丁预测。我们框架的一个独特元素是逐补丁解码层，这与以往依赖序列级解码的方法不同。这种设计直接将单个补丁转换为时间序列，从而显著增强了模型在掌握基于补丁的时间表示方面的能力。aLLM4TS在多个下游任务中表现出优越性能，证明其在生成具有增强可迁移性的时间表示方面的有效性，标志着LLMs适应时间序列分析的关键进展。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日