We propose a continuous optimization framework for discovering a latent directed acyclic graph (DAG) from observational data. Our approach optimizes over the polytope of permutation vectors, the so-called Permutahedron, to learn a topological ordering. Edges can be optimized jointly, or learned conditional on the ordering via a non-differentiable subroutine. Compared to existing continuous optimization approaches our formulation has a number of advantages including: 1. validity: optimizes over exact DAGs as opposed to other relaxations optimizing approximate DAGs; 2. modularity: accommodates any edge-optimization procedure, edge structural parameterization, and optimization loss; 3. end-to-end: either alternately iterates between node-ordering and edge-optimization, or optimizes them jointly. We demonstrate, on real-world data problems in protein-signaling and transcriptional network discovery, that our approach lies on the Pareto frontier of two key metrics, the SID and SHD.
翻译:我们提出了一种连续优化框架,用于从观测数据中发现潜在的有向无环图(DAG)。该方法在置换向量多面体(即Permutahedron)上优化,以学习拓扑排序。边的优化可联合进行,也可通过不可微子程序基于排序条件学习。与现有连续优化方法相比,该公式具有以下优势:1. 有效性:优化精确的DAG而非其他近似DAG的松弛方法;2. 模块化:可适配任意边优化过程、边结构参数化及优化损失函数;3. 端到端性:既可交替迭代节点排序与边优化,也可联合优化。在蛋白质信号传导与转录调控网络发现的真实数据问题中,我们证明该方法在SID和SHD两个关键指标上处于帕累托前沿。