How Expressive are Graph Neural Networks in Recommendation?

Graph Neural Networks (GNNs) have demonstrated superior performance on various graph learning tasks, including recommendation, where they leverage user-item collaborative filtering signals in graphs. However, theoretical formulations of their capability are scarce, despite their empirical effectiveness in state-of-the-art recommender models. Recently, research has explored the expressiveness of GNNs in general, demonstrating that message passing GNNs are at most as powerful as the Weisfeiler-Lehman test, and that GNNs combined with random node initialization are universal. Nevertheless, the concept of "expressiveness" for GNNs remains vaguely defined. Most existing works adopt the graph isomorphism test as the metric of expressiveness, but this graph-level task may not effectively assess a model's ability in recommendation, where the objective is to distinguish nodes of different closeness. In this paper, we provide a comprehensive theoretical analysis of the expressiveness of GNNs in recommendation, considering three levels of expressiveness metrics: graph isomorphism (graph-level), node automorphism (node-level), and topological closeness (link-level). We propose the topological closeness metric to evaluate GNNs' ability to capture the structural distance between nodes, which aligns closely with the objective of recommendation. To validate the effectiveness of this new metric in evaluating recommendation performance, we introduce a learning-less GNN algorithm that is optimal on the new metric and can be optimal on the node-level metric with suitable modification. We conduct extensive experiments comparing the proposed algorithm against various types of state-of-the-art GNN models to explore the explainability of the new metric in the recommendation task. For reproducibility, implementation codes are available at https://github.com/HKUDS/GTE.

翻译：图神经网络（GNN）在各种图学习任务（包括推荐）中展现出卓越性能，其通过在图中利用用户-物品协同过滤信号发挥作用。然而，尽管GNN在最新推荐模型中具有实证有效性，但其能力的理论形式化表述仍十分匮乏。近期，研究探讨了GNN的通用表达能力，证明消息传递型GNN至多与Weisfeiler-Lehman检验能力相当，而结合随机节点初始化的GNN具有普适性。但GNN"表达能力"的概念仍定义模糊。现有大多数研究采用图同构检验作为表达能力度量指标，但这种图级任务可能无法有效评估模型在推荐中的能力——推荐任务的目标是区分不同紧密程度的节点。本文从三个层面的表达能力度量指标出发，对GNN在推荐中的表达能力进行了全面的理论分析：图同构（图级）、节点自同构（节点级）和拓扑接近度（边级）。我们提出拓扑接近度度量指标，用于评估GNN捕获节点间结构距离的能力，这与推荐目标高度契合。为验证新指标在评估推荐性能方面的有效性，我们引入了一种无学习型GNN算法，该算法在新指标上最优，且经过适当修改后可在节点级指标上达到最优。通过大量实验，我们将所提算法与各类最新GNN模型进行对比，探究新指标在推荐任务中的可解释性。为保障可复现性，实现代码已开源至https://github.com/HKUDS/GTE。