Learning to Count Isomorphisms with Graph Neural Networks

Subgraph isomorphism counting is an important problem on graphs, as many graph-based tasks exploit recurring subgraph patterns. Classical methods usually boil down to a backtracking framework that needs to navigate a huge search space with prohibitive computational costs. Some recent studies resort to graph neural networks (GNNs) to learn a low-dimensional representation for both the query and input graphs, in order to predict the number of subgraph isomorphisms on the input graph. However, typical GNNs employ a node-centric message passing scheme that receives and aggregates messages on nodes, which is inadequate in complex structure matching for isomorphism counting. Moreover, on an input graph, the space of possible query graphs is enormous, and different parts of the input graph will be triggered to match different queries. Thus, expecting a fixed representation of the input graph to match diversely structured query graphs is unrealistic. In this paper, we propose a novel GNN called Count-GNN for subgraph isomorphism counting, to deal with the above challenges. At the edge level, given that an edge is an atomic unit of encoding graph structures, we propose an edge-centric message passing scheme, where messages on edges are propagated and aggregated based on the edge adjacency to preserve fine-grained structural information. At the graph level, we modulate the input graph representation conditioned on the query, so that the input graph can be adapted to each query individually to improve their matching. Finally, we conduct extensive experiments on a number of benchmark datasets to demonstrate the superior performance of Count-GNN.

翻译：子图同构计数是图上的一个重要问题，因为许多基于图的任务依赖于重复出现的子图模式。传统方法通常归结为回溯框架，需要以高昂的计算成本在巨大的搜索空间中导航。近期一些研究尝试利用图神经网络（GNN）来学习查询图和输入图的低维表示，从而预测输入图上子图同构的数量。然而，典型的GNN采用以节点为中心的消息传递机制，在节点上接收和聚合消息，这难以胜任同构计数中的复杂结构匹配。此外，在输入图上，可能的查询图空间极其庞大，输入图的不同部分会被触发以匹配不同的查询。因此，期望输入图的固定表示能匹配结构多样的查询图并不现实。本文提出一种名为Count-GNN的新型GNN用于子图同构计数，以应对上述挑战。在边级别上，鉴于边是编码图结构的原子单元，我们提出以边为中心的消息传递机制，基于边邻接关系传播和聚合边上的消息，从而保留细粒度的结构信息。在图级别上，我们根据查询条件对输入图表示进行调制，使输入图能够针对每个查询单独调整，以提高匹配效果。最后，我们在多个基准数据集上进行广泛实验，证明了Count-GNN的优越性能。