GNNFlow: A Distributed Framework for Continuous Temporal GNN Learning on Dynamic Graphs

Graph Neural Networks (GNNs) play a crucial role in various fields. However, most existing deep graph learning frameworks assume pre-stored static graphs and do not support training on graph streams. In contrast, many real-world graphs are dynamic and contain time domain information. We introduce GNNFlow, a distributed framework that enables efficient continuous temporal graph representation learning on dynamic graphs on multi-GPU machines. GNNFlow introduces an adaptive time-indexed block-based data structure that effectively balances memory usage with graph update and sampling operation efficiency. It features a hybrid GPU-CPU graph data placement for rapid GPU-based temporal neighborhood sampling and kernel optimizations for enhanced sampling processes. A dynamic GPU cache for node and edge features is developed to maximize cache hit rates through reuse and restoration strategies. GNNFlow supports distributed training across multiple machines with static scheduling to ensure load balance. We implement GNNFlow based on DGL and PyTorch. Our experimental results show that GNNFlow provides up to 21.1x faster continuous learning than existing systems.

翻译：图神经网络（GNN）在各领域中发挥着至关重要的作用。然而，现有的大多数深度图学习框架假设预存储的静态图，不支持对图流（graph streams）的训练。相比之下，许多真实世界的图是动态的并包含时域信息。我们提出GNNFlow，一个面向多GPU机器上动态图的高效连续时序图表示学习的分布式框架。GNNFlow引入了一种自适应时间索引的块（block）数据结构，有效平衡了内存使用与图更新及采样操作的效率。它采用混合GPU-CPU图数据放置策略，实现基于GPU的快速时序邻域采样，并通过内核优化增强采样过程。我们还开发了一种面向节点和边特征的动态GPU缓存，通过重用与恢复策略最大化缓存命中率。GNNFlow通过静态调度支持多机分布式训练，确保负载均衡。我们基于DGL和PyTorch实现了GNNFlow。实验结果表明，与现有系统相比，GNNFlow的连续学习速度最高提升21.1倍。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日