Learning Causal Abstractions of Linear Structural Causal Models

The need for modelling causal knowledge at different levels of granularity arises in several settings. Causal Abstraction provides a framework for formalizing this problem by relating two Structural Causal Models at different levels of detail. Despite increasing interest in applying causal abstraction, e.g. in the interpretability of large machine learning models, the graphical and parametrical conditions under which a causal model can abstract another are not known. Furthermore, learning causal abstractions from data is still an open problem. In this work, we tackle both issues for linear causal models with linear abstraction functions. First, we characterize how the low-level coefficients and the abstraction function determine the high-level coefficients and how the high-level model constrains the causal ordering of low-level variables. Then, we apply our theoretical results to learn high-level and low-level causal models and their abstraction function from observational data. In particular, we introduce Abs-LiNGAM, a method that leverages the constraints induced by the learned high-level model and the abstraction function to speedup the recovery of the larger low-level model, under the assumption of non-Gaussian noise terms. In simulated settings, we show the effectiveness of learning causal abstractions from data and the potential of our method in improving scalability of causal discovery.

翻译：在多种场景下，需要以不同粒度层次对因果知识进行建模。因果抽象通过关联两个不同详细程度的结构因果模型，为此问题提供了形式化框架。尽管因果抽象的应用日益受到关注（例如在大型机器学习模型的可解释性领域），但因果模型能够抽象另一模型所需的图结构和参数条件尚不明确。此外，从数据中学习因果抽象仍是一个开放性问题。本研究针对线性因果模型与线性抽象函数同时探讨了这两个问题。首先，我们刻画了底层系数与抽象函数如何决定高层系数，以及高层模型如何约束底层变量的因果序。随后，我们将理论结果应用于从观测数据中学习高层与底层因果模型及其抽象函数。特别地，我们提出了Abs-LiNGAM方法，该方法在非高斯噪声项的假设下，利用已学习的高层模型和抽象函数所诱导的约束，加速对更大规模底层模型的恢复。在模拟环境中，我们展示了从数据中学习因果抽象的有效性，以及该方法在提升因果发现可扩展性方面的潜力。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日