Inferring causal structure from data is a challenging task of fundamental importance in science. Observational data are often insufficient to identify a system's causal structure uniquely. While conducting interventions (i.e., experiments) can improve the identifiability, such samples are usually challenging and expensive to obtain. Hence, experimental design approaches for causal discovery aim to minimize the number of interventions by estimating the most informative intervention target. In this work, we propose a novel Gradient-based Intervention Targeting method, abbreviated GIT, that 'trusts' the gradient estimator of a gradient-based causal discovery framework to provide signals for the intervention acquisition function. We provide extensive experiments in simulated and real-world datasets and demonstrate that GIT performs on par with competitive baselines, surpassing them in the low-data regime.
翻译:从数据中推断因果结构是科学中一项具有根本重要性的挑战性任务。观测数据往往不足以唯一确定系统的因果结构。虽然进行干预(即实验)可以提高可辨识性,但此类样本通常难以获取且成本高昂。因此,面向因果发现的实验设计方法旨在通过估计最具信息量的干预目标来最小化干预次数。本研究提出一种名为GIT的新型梯度干预目标选择方法,该方法"信任"基于梯度的因果发现框架中的梯度估计器,为干预采集函数提供信号。我们在模拟数据集和真实世界数据集上进行了大量实验,结果表明GIT与竞争基线方法性能相当,并在低数据场景下超越它们。