In contrastive learning, the choice of ``view'' controls the information that the representation captures and influences the performance of the model. However, leading graph contrastive learning methods generally produce views via random corruption or learning, which could lead to the loss of essential information and alteration of semantic information. An anchor view that maintains the essential information of input graphs for contrastive learning has been hardly investigated. In this paper, based on the theory of graph information bottleneck, we deduce the definition of this anchor view; put differently, \textit{the anchor view with essential information of input graph is supposed to have the minimal structural uncertainty}. Furthermore, guided by structural entropy, we implement the anchor view, termed \textbf{SEGA}, for graph contrastive learning. We extensively validate the proposed anchor view on various benchmarks regarding graph classification under unsupervised, semi-supervised, and transfer learning and achieve significant performance boosts compared to the state-of-the-art methods.
翻译:在对比学习中,“视图”的选择决定了表征所捕获的信息并影响模型性能。然而,主流的图对比学习方法通常通过随机破坏或学习生成视图,这可能导致关键信息丢失和语义信息改变。一种能够保留输入图关键信息用于对比学习的锚视图尚未得到充分研究。本文基于图信息瓶颈理论,推导出该锚视图的定义;换言之,**具有输入图关键信息的锚视图应具备最小的结构不确定性**。此外,我们以结构熵为指导,实现了名为**SEGA**的锚视图,用于图对比学习。我们在无监督、半监督和迁移学习场景下的图分类基准上广泛验证了所提出的锚视图,相比当前最优方法取得了显著的性能提升。