Temporal Interaction Graphs (TIGs) are widely employed to model intricate real-world systems such as financial systems and social networks. To capture the dynamism and interdependencies of nodes, existing TIG embedding models need to process edges sequentially and chronologically. However, this requirement prevents it from being processed in parallel and struggle to accommodate burgeoning data volumes to GPU. Consequently, many large-scale temporal interaction graphs are confined to CPU processing. Furthermore, a generalized GPU scaling and acceleration approach remains unavailable. To facilitate large-scale TIGs' implementation on GPUs for acceleration, we introduce a novel training approach namely Streaming Edge Partitioning and Parallel Acceleration for Temporal Interaction Graph Embedding (SPEED). The SPEED is comprised of a Streaming Edge Partitioning Component (SEP) which addresses space overhead issue by assigning fewer nodes to each GPU, and a Parallel Acceleration Component (PAC) which enables simultaneous training of different sub-graphs, addressing time overhead issue. Our method can achieve a good balance in computing resources, computing time, and downstream task performance. Empirical validation across 7 real-world datasets demonstrates the potential to expedite training speeds by a factor of up to 19.29x. Simultaneously, resource consumption of a single-GPU can be diminished by up to 69%, thus enabling the multiple GPU-based training and acceleration encompassing millions of nodes and billions of edges. Furthermore, our approach also maintains its competitiveness in downstream tasks.
翻译:时序交互图(Temporal Interaction Graphs, TIGs)广泛应用于金融系统、社交网络等复杂真实系统的建模。为捕捉节点的动态性与相互依赖性,现有TIG嵌入模型需按时间顺序依次处理边,但这种顺序处理机制阻碍了并行计算,且难以适应数据量激增对GPU显存的压力,导致大规模时序交互图局限于CPU处理。目前尚缺乏通用的GPU扩展与加速方案。为实现大规模TIG在GPU上的高效加速,本文提出一种新型训练方法——时序交互图嵌入的流式边划分与并行加速(SPEED)。该方法包含两个核心组件:流式边划分组件(SEP)通过为每块GPU分配更少节点以解决空间开销问题;并行加速组件(PAC)通过支持不同子图同步训练解决时间开销问题。本方法能在计算资源、计算时间与下游任务性能间实现良好平衡。在7个真实数据集上的实验表明,该方法可将训练速度提升最高19.29倍,同时单GPU资源消耗降低最高69%,从而支持涵盖数百万节点和数十亿边的多GPU训练与加速,且在下游任务中仍保持竞争力。