Graph self-supervised learning has sparked a research surge in training informative representations without accessing any labeled data. However, our understanding of graph self-supervised learning remains limited, and the inherent relationships between various self-supervised tasks are still unexplored. Our paper aims to provide a fresh understanding of graph self-supervised learning based on task correlations. Specifically, we evaluate the performance of the representations trained by one specific task on other tasks and define correlation values to quantify task correlations. Through this process, we unveil the task correlations between various self-supervised tasks and can measure their expressive capabilities, which are closely related to downstream performance. By analyzing the correlation values between tasks across various datasets, we reveal the complexity of task correlations and the limitations of existing multi-task learning methods. To obtain more capable representations, we propose Graph Task Correlation Modeling (GraphTCM) to illustrate the task correlations and utilize it to enhance graph self-supervised training. The experimental results indicate that our method significantly outperforms existing methods across various downstream tasks.
翻译:图自监督学习在无需任何标注数据的情况下训练信息表示方面引发了研究热潮。然而,我们对图自监督学习的理解仍然有限,各种自监督任务之间的内在关系尚未得到探索。本文旨在基于任务相关性提供对图自监督学习的新理解。具体而言,我们评估了由特定任务训练的表示在其他任务上的性能,并定义了相关值来量化任务相关性。通过这一过程,我们揭示了各种自监督任务之间的任务相关性,并能够衡量它们的表达能力,这与下游性能密切相关。通过分析不同数据集上的任务间相关值,我们揭示了任务相关性的复杂性以及现有多任务学习方法的局限性。为了获得更强大的表示,我们提出了图任务相关性建模(GraphTCM)来阐释任务相关性,并利用它增强图自监督训练。实验结果表明,我们的方法在各种下游任务上显著优于现有方法。