Conventional supervised learning methods typically assume i.i.d samples and are found to be sensitive to out-of-distribution (OOD) data. We propose Generative Causal Representation Learning (GCRL) which leverages causality to facilitate knowledge transfer under distribution shifts. While we evaluate the effectiveness of our proposed method in human trajectory prediction models, GCRL can be applied to other domains as well. First, we propose a novel causal model that explains the generative factors in motion forecasting datasets using features that are common across all environments and with features that are specific to each environment. Selection variables are used to determine which parts of the model can be directly transferred to a new environment without fine-tuning. Second, we propose an end-to-end variational learning paradigm to learn the causal mechanisms that generate observations from features. GCRL is supported by strong theoretical results that imply identifiability of the causal model under certain assumptions. Experimental results on synthetic and real-world motion forecasting datasets show the robustness and effectiveness of our proposed method for knowledge transfer under zero-shot and low-shot settings by substantially outperforming the prior motion forecasting models on out-of-distribution prediction.
翻译:传统监督学习方法通常假设样本独立同分布,但被发现在面对分布外数据时表现敏感。我们提出生成式因果表征学习,其利用因果关系促进分布偏移下的知识迁移。虽然我们在人类轨迹预测模型中评估了所提方法的有效性,但该方法也可应用于其他领域。首先,我们提出一种新颖的因果模型,该模型利用所有环境共有的特征及特定于每个环境的特征,解释运动预测数据集中的生成因子。通过选择变量决定模型哪些部分可直接迁移到新环境而无需微调。其次,我们提出端到端的变分学习范式,以学习从特征生成观测数据的因果机制。生成式因果表征学习得到强理论结果支持,表明在特定假设下因果模型的可辨识性。在合成数据集和真实运动预测数据集上的实验结果表明,所提方法在零样本和少样本场景下通过显著超越先前的运动预测模型,在分布外预测中展现出鲁棒性和有效性。