Inferring causal structure from data is a challenging task of fundamental importance in science. Observational data are often insufficient to identify a system's causal structure uniquely. While conducting interventions (i.e., experiments) can improve the identifiability, such samples are usually challenging and expensive to obtain. Hence, experimental design approaches for causal discovery aim to minimize the number of interventions by estimating the most informative intervention target. In this work, we propose a novel Gradient-based Intervention Targeting method, abbreviated GIT, that 'trusts' the gradient estimator of a gradient-based causal discovery framework to provide signals for the intervention acquisition function. We provide extensive experiments in simulated and real-world datasets and demonstrate that GIT performs on par with competitive baselines, surpassing them in the low-data regime.
翻译:从数据中推断因果结构是科学中一项具有基础重要性的挑战性任务。观测数据往往不足以唯一确定系统的因果结构。尽管进行干预(即实验)可以提高可辨识性,但此类样本通常难以获取且成本高昂。因此,针对因果发现的实验设计方法旨在通过估计最具信息量的干预目标来最小化干预次数。在本工作中,我们提出了一种新颖的基于梯度的干预目标选择方法(简称GIT),该方法"信任"基于梯度的因果发现框架的梯度估计器,以为干预采集函数提供信号。我们在模拟和真实世界数据集上进行了大量实验,结果表明GIT的性能与竞争基线方法相当,且在低数据场景下优于它们。