Self-Supervised Deconfounding Against Spatio-Temporal Shifts: Theory and Modeling

As an important application of spatio-temporal (ST) data, ST traffic forecasting plays a crucial role in improving urban travel efficiency and promoting sustainable development. In practice, the dynamics of traffic data frequently undergo distributional shifts attributed to external factors such as time evolution and spatial differences. This entails forecasting models to handle the out-of-distribution (OOD) issue where test data is distributed differently from training data. In this work, we first formalize the problem by constructing a causal graph of past traffic data, future traffic data, and external ST contexts. We reveal that the failure of prior arts in OOD traffic data is due to ST contexts acting as a confounder, i.e., the common cause for past data and future ones. Then, we propose a theoretical solution named Disentangled Contextual Adjustment (DCA) from a causal lens. It differentiates invariant causal correlations against variant spurious ones and deconfounds the effect of ST contexts. On top of that, we devise a Spatio-Temporal sElf-superVised dEconfounding (STEVE) framework. It first encodes traffic data into two disentangled representations for associating invariant and variant ST contexts. Then, we use representative ST contexts from three conceptually different perspectives (i.e., temporal, spatial, and semantic) as self-supervised signals to inject context information into both representations. In this way, we improve the generalization ability of the learned context-oriented representations to OOD ST traffic forecasting. Comprehensive experiments on four large-scale benchmark datasets demonstrate that our STEVE consistently outperforms the state-of-the-art baselines across various ST OOD scenarios.

翻译：作为时空数据的重要应用，时空交通预测在提升城市出行效率与促进可持续发展方面扮演关键角色。在实践中，受时间演变和空间差异等外部因素影响，交通数据的动态分布频繁发生偏移。这要求预测模型能够处理测试数据分布与训练数据不同的分布外（OOD）问题。本研究首先通过构建历史交通数据、未来交通数据及外部时空上下文的因果图形式化该问题。我们揭示出，现有方法在OOD交通数据中失效的原因在于时空上下文扮演了混杂因子角色——即历史数据与未来数据的共同原因。进而，我们从因果视角提出名为解耦上下文调整（DCA）的理论解决方案，通过区分不变因果关联与变化伪关联，消除时空上下文的混淆效应。在此基础上，我们设计了时空自监督去混淆（STEVE）框架。该框架首先将交通数据编码为两种解耦表征，分别关联不变与变化的时空上下文；随后，从时间、空间和语义三个概念性视角选取代表性时空上下文作为自监督信号，将上下文信息注入两种表征。通过这种方式，我们提升了面向OOD时空交通预测的上下文导向表征的泛化能力。在四个大规模基准数据集上的综合实验表明，我们的STEVE框架在各类时空OOD场景中均持续超越当前最优基线方法。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日