We introduce a gradient-based approach for the problem of Bayesian optimal experimental design to learn causal models in a batch setting -- a critical component for causal discovery from finite data where interventions can be costly or risky. Existing methods rely on greedy approximations to construct a batch of experiments while using black-box methods to optimize over a single target-state pair to intervene with. In this work, we completely dispose of the black-box optimization techniques and greedy heuristics and instead propose a conceptually simple end-to-end gradient-based optimization procedure to acquire a set of optimal intervention target-state pairs. Such a procedure enables parameterization of the design space to efficiently optimize over a batch of multi-target-state interventions, a setting which has hitherto not been explored due to its complexity. We demonstrate that our proposed method outperforms baselines and existing acquisition strategies in both single-target and multi-target settings across a number of synthetic datasets.
翻译:我们提出了一种基于梯度的方法,用于解决批量设置中学习因果模型的贝叶斯最优实验设计问题——这是从有限数据中进行因果发现的关键组成部分,其中干预可能代价高昂或具有风险。现有方法依赖于贪婪近似来构建一组实验,同时使用黑盒方法优化需要干预的单个目标-状态对。在本工作中,我们完全摒弃了黑盒优化技术和贪婪启发式方法,而提出了一种概念上简单的端到端梯度优化过程来获取一组最优的干预目标-状态对。该过程能够对设计空间进行参数化,从而高效优化一组多目标-状态干预——由于复杂度高,这一设置此前尚未被探索。我们证明,所提出的方法在多个合成数据集上的单目标和多目标设置中均优于基线和现有采集策略。