How Can Large Language Models Understand Spatial-Temporal Data?

While Large Language Models (LLMs) dominate tasks like natural language processing and computer vision, harnessing their power for spatial-temporal forecasting remains challenging. The disparity between sequential text and complex spatial-temporal data hinders this application. To address this issue, this paper introduces STG-LLM, an innovative approach empowering LLMs for spatial-temporal forecasting. We tackle the data mismatch by proposing: 1) STG-Tokenizer: This spatial-temporal graph tokenizer transforms intricate graph data into concise tokens capturing both spatial and temporal relationships; 2) STG-Adapter: This minimalistic adapter, consisting of linear encoding and decoding layers, bridges the gap between tokenized data and LLM comprehension. By fine-tuning only a small set of parameters, it can effectively grasp the semantics of tokens generated by STG-Tokenizer, while preserving the original natural language understanding capabilities of LLMs. Extensive experiments on diverse spatial-temporal benchmark datasets show that STG-LLM successfully unlocks LLM potential for spatial-temporal forecasting. Remarkably, our approach achieves competitive performance on par with dedicated SOTA methods.

翻译：尽管大型语言模型（LLMs）在自然语言处理和计算机视觉等任务中占据主导地位，但将其能力用于时空预测仍然充满挑战。顺序文本与复杂时空数据之间的差异阻碍了这一应用。为解决这一问题，本文提出了STG-LLM，一种创新的方法，旨在赋能LLMs进行时空预测。我们通过以下方式解决数据不匹配问题：1）STG-Tokenizer：这种时空图分词器将复杂的图数据转化为简洁的标记，同时捕捉空间和时间关系；2）STG-Adapter：这种极简的适配器，由线性编码和解码层组成，弥合了分词数据与LLM理解之间的差距。通过仅微调少量参数，它能有效理解STG-Tokenizer生成的标记语义，同时保留LLMs原有的自然语言理解能力。在多种时空基准数据集上的广泛实验表明，STG-LLM成功释放了LLMs在时空预测方面的潜力。值得注意的是，我们的方法取得了与专用SOTA方法相媲美的竞争性能。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日