A Comprehensive Survey on Deep Graph Representation Learning

Wei Ju,Zheng Fang,Yiyang Gu,Zequn Liu,Qingqing Long,Ziyue Qiao,Yifang Qin,Jianhao Shen,Fang Sun,Zhiping Xiao,Junwei Yang,Jingyang Yuan,Yusheng Zhao,Yifan Wang,Xiao Luo,Ming Zhang

from arxiv, Accepted by Neural Networks 2024

Graph representation learning aims to effectively encode high-dimensional sparse graph-structured data into low-dimensional dense vectors, which is a fundamental task that has been widely studied in a range of fields, including machine learning and data mining. Classic graph embedding methods follow the basic idea that the embedding vectors of interconnected nodes in the graph can still maintain a relatively close distance, thereby preserving the structural information between the nodes in the graph. However, this is sub-optimal due to: (i) traditional methods have limited model capacity which limits the learning performance; (ii) existing techniques typically rely on unsupervised learning strategies and fail to couple with the latest learning paradigms; (iii) representation learning and downstream tasks are dependent on each other which should be jointly enhanced. With the remarkable success of deep learning, deep graph representation learning has shown great potential and advantages over shallow (traditional) methods, there exist a large number of deep graph representation learning techniques have been proposed in the past decade, especially graph neural networks. In this survey, we conduct a comprehensive survey on current deep graph representation learning algorithms by proposing a new taxonomy of existing state-of-the-art literature. Specifically, we systematically summarize the essential components of graph representation learning and categorize existing approaches by the ways of graph neural network architectures and the most recent advanced learning paradigms. Moreover, this survey also provides the practical and promising applications of deep graph representation learning. Last but not least, we state new perspectives and suggest challenging directions which deserve further investigations in the future.

翻译：图表示学习旨在将高维稀疏的图结构数据有效编码为低维稠密向量，这是一项在机器学习与数据挖掘等多个领域被广泛研究的基础任务。经典图嵌入方法遵循基本思想：图中相互连接节点的嵌入向量能保持相对较近的距离，从而保留节点间的结构信息。然而，这类方法存在次优性：（i）传统方法模型容量有限，限制了学习性能；（ii）现有技术通常依赖无监督学习策略，未能与最新学习范式结合；（iii）表示学习与下游任务相互依赖，需协同增强。随着深度学习的显著成功，深度图表示学习已展现出超越浅层（传统）方法的巨大潜力与优势。过去十年间，大量深度图表示学习技术被提出，尤其是图神经网络。在本综述中，我们通过提出一种针对现有最先进文献的新分类体系，对当前深度图表示学习算法进行了全面梳理。具体而言，我们系统总结了图表示学习的核心组件，并根据图神经网络架构类型及最新前沿学习范式对现有方法进行了分类。此外，本综述还介绍了深度图表示学习的实际应用与前景。最后，我们提出了新视角及未来值得探索的挑战性研究方向。