Directed acyclic graphs (DAGs) encode a lot of information about a particular distribution in their structure. However, compute required to infer these structures is typically super-exponential in the number of variables, as inference requires a sweep of a combinatorially large space of potential structures. That is, until recent advances made it possible to search this space using a differentiable metric, drastically reducing search time. While this technique -- named NOTEARS -- is widely considered a seminal work in DAG-discovery, it concedes an important property in favour of differentiability: transportability. To be transportable, the structures discovered on one dataset must apply to another dataset from the same domain. We introduce D-Struct which recovers transportability in the discovered structures through a novel architecture and loss function while remaining fully differentiable. Because D-Struct remains differentiable, our method can be easily adopted in existing differentiable architectures, as was previously done with NOTEARS. In our experiments, we empirically validate D-Struct with respect to edge accuracy and structural Hamming distance in a variety of settings.
翻译:有向无环图(DAGs)在其结构中编码了大量关于特定分布的信息。然而,推断这些结构所需的计算量通常与变量数量呈超指数关系,因为推断需要遍历组合数巨大的潜在结构空间。直到近期进展使得通过可微分指标搜索该空间成为可能,从而大幅缩短了搜索时间。尽管这一名为NOTEARS的技术被广泛视为DAG发现领域的开创性工作,但它为了可微性放弃了一个重要性质:可迁移性。若要实现可迁移,从一个数据集发现的结构必须适用于同一领域的另一个数据集。我们提出了D-Struct,通过新颖的架构和损失函数在保持完全可微性的同时恢复了所发现结构的可迁移性。由于D-Struct保持可微性,我们的方法可以像此前NOTEARS那样轻松集成到现有可微架构中。在实验中,我们从边准确度和结构汉明距离等多个维度对D-Struct进行了实证验证。