Discovering causal structure from purely observational data (i.e., causal discovery), aiming to identify causal relationships among variables, is a fundamental task in machine learning. The recent invention of differentiable score-based DAG learners is a crucial enabler, which reframes the combinatorial optimization problem into a differentiable optimization with a DAG constraint over directed graph space. Despite their great success, these cutting-edge DAG learners incorporate DAG-ness independent score functions to evaluate the directed graph candidates, lacking in considering graph structure. As a result, measuring the data fitness alone regardless of DAG-ness inevitably leads to discovering suboptimal DAGs and model vulnerabilities. Towards this end, we propose a dynamic causal space for DAG structure learning, coined CASPER, that integrates the graph structure into the score function as a new measure in the causal space to faithfully reflect the causal distance between estimated and ground truth DAG. CASPER revises the learning process as well as enhances the DAG structure learning via adaptive attention to DAG-ness. Grounded by empirical visualization, CASPER, as a space, satisfies a series of desired properties, such as structure awareness and noise robustness. Extensive experiments on both synthetic and real-world datasets clearly validate the superiority of our CASPER over the state-of-the-art causal discovery methods in terms of accuracy and robustness.
翻译:从纯观测数据中发现因果结构(即因果发现),旨在识别变量间的因果关系,是机器学习中的基础任务。近期提出的可微分基于评分的有向无环图(DAG)学习器是关键推动因素,它将组合优化问题重构为带有有向图空间上DAG约束的可微优化问题。尽管这些前沿DAG学习器取得了巨大成功,但它们采用与DAG无关的独立评分函数来评估有向图候选,缺乏对图结构的考虑。因此,仅衡量数据拟合度而不考虑DAG特性,不可避免地导致发现次优DAG及模型脆弱性。为此,我们提出用于DAG结构学习的动态因果空间,命名为CASPER,该空间将图结构整合进评分函数,作为因果空间中的新度量以忠实反映估计DAG与真实DAG之间的因果距离。CASPER通过自适应关注DAG特性来修订学习过程并增强DAG结构学习。基于经验可视化的支撑,CASPER作为空间满足一系列期望性质,如结构感知性与噪声鲁棒性。在合成数据集和真实数据集上的大量实验明确验证了我们的CASPER在准确性与鲁棒性上优于最先进的因果发现方法。