GraphMend：修复 PyTorch 2 中图中断的代码转换技术 (GraphMend: Code Transformations for Fixing Graph Breaks in PyTorch 2) - 专知论文

会员服务 ·

0

代码 · PyTorch · 动态控制 · 动态控制流 · Python ·

GraphMend: Code Transformations for Fixing Graph Breaks in PyTorch 2

翻译：GraphMend：修复 PyTorch 2 中图中断的代码转换技术

Savini Kashmira,Jayanaka Dantanarayana,Thamirawaran Sathiyalogeswaran,Yichao Yuan,Nishil Talati,Krisztian Flautner,Lingjia Tang,Jason Mars

This paper presents GRAPHMEND, a high-level compiler technique that eliminates FX graph breaks in PyTorch 2 programs. Although PyTorch 2 introduced TorchDynamo and TorchInductor to enable just-in-time graph compilation, unresolved dynamic control flow and unsupported Python constructs often fragment models into multiple FX graphs. These fragments force frequent fallbacks to eager mode, introduce costly CPU-to-GPU synchronizations, and reduce optimization opportunities. GRAPHMEND addresses this limitation by analyzing and transforming source code before execution. Built on the Jaseci compilation framework, GRAPHMEND introduces two code transformations that remove graph breaks due to dynamic control flow and Python side effects. This design allows PyTorch's compilation pipeline to capture larger, uninterrupted FX graphs without requiring manual refactoring by developers. Evaluation across eight Hugging Face models shows that GRAPHMEND removes graph breaks due to dynamic control flow and Python side effects, reducing the break count to 0 in 6 models and reducing it from 5 to 2 in another model. On NVIDIA RTX 3090 and A40 GPUs, GRAPHMEND achieves up to 75% latency reductions and up to 8% higher end-to-end throughput. These results demonstrate that high-level code transformation is an effective complement to PyTorch's dynamic JIT compilation pipeline, substantially improving both usability and performance.

翻译：本文提出 GRAPHMEND，一种用于消除 PyTorch 2 程序中 FX 图中断的高级编译技术。尽管 PyTorch 2 引入了 TorchDynamo 和 TorchInductor 以支持即时图编译，但未解决的动态控制流及不支持的 Python 结构仍常将模型分割为多个 FX 图。这些片段迫使系统频繁回退至即时执行模式，引入昂贵的 CPU 至 GPU 同步开销，并减少优化机会。GRAPHMEND 通过在执行前分析并转换源代码来解决此限制。该技术基于 Jaseci 编译框架构建，引入了两种代码转换方法，以消除由动态控制流和 Python 副作用导致的图中断。此设计使得 PyTorch 的编译流水线能够捕获更大且连续的 FX 图，无需开发者手动重构代码。在八个 Hugging Face 模型上的评估表明，GRAPHMEND 成功消除了由动态控制流和 Python 副作用引起的图中断：在六个模型中将中断数降为 0，在另一个模型中将中断数从 5 降至 2。在 NVIDIA RTX 3090 和 A40 GPU 上，GRAPHMEND 实现了高达 75% 的延迟降低和高达 8% 的端到端吞吐量提升。这些结果表明，高级代码转换技术可有效补充 PyTorch 的动态即时编译流水线，显著提升其可用性与性能。

0

相关内容

代码（Code）是专知网的一个重要知识资料文档板块，旨在整理收录论文源代码、复现代码，经典工程代码等，便于用户查阅下载使用。

图增强生成（GraphRAG）

图增强生成（GraphRAG）

专知会员服务

34+阅读 · 2025年1月4日

【IJCAI2024】Gradformer：具有指数衰减的图变换器

【IJCAI2024】Gradformer：具有指数衰减的图变换器

专知会员服务

17+阅读 · 2024年4月25日

Graph Transformer近期进展

Graph Transformer近期进展

专知会员服务

65+阅读 · 2023年1月5日

如何重构图神经网络？98页LoG2022《图重连:从理论到应用》教程，附代码

如何重构图神经网络？98页LoG2022《图重连:从理论到应用》教程，附代码

专知会员服务

44+阅读 · 2022年12月13日

tf_geometric — 基于TensorFlow的友好高效的图神经网络（GNN）库

tf_geometric — 基于TensorFlow的友好高效的图神经网络（GNN）库

专知会员服务

26+阅读 · 2021年8月9日

【干货书】深度学习Pytorch快速入门，150页pdf，Deep Learning with PyTorch

【干货书】深度学习Pytorch快速入门，150页pdf，Deep Learning with PyTorch

专知会员服务

156+阅读 · 2021年4月4日

【KDD2020-清华大学】自适应图编码器，Adaptive Graph Encoder for Attributed Graph Embedding

【KDD2020-清华大学】自适应图编码器，Adaptive Graph Encoder for Attributed Graph Embedding

专知会员服务

99+阅读 · 2020年7月6日

GRAPH-BERT ：学习图表示只需要注意力，GRAPH-BERT : Only Attention is Needed for Learning Graph Representations

GRAPH-BERT ：学习图表示只需要注意力，GRAPH-BERT : Only Attention is Needed for Learning Graph Representations

专知会员服务

78+阅读 · 2020年5月31日

【CIKM 2019论文】重力启发式图自编码器定向链路预测（Gravity-Inspired Graph Autoencoders for Directed Link Prediction），Guillaume Salha，Stratis Limnios

【CIKM 2019论文】重力启发式图自编码器定向链路预测（Gravity-Inspired Graph Autoencoders for Directed Link Prediction），Guillaume Salha，Stratis Limnios

专知会员服务

28+阅读 · 2019年11月20日

【ICDAR2019教程】模式识别和文档图像中基于图的方法，Graph-based Methods in Pattern Recognition and Document Image Analysis

【ICDAR2019教程】模式识别和文档图像中基于图的方法，Graph-based Methods in Pattern Recognition and Document Image Analysis

专知会员服务

30+阅读 · 2019年9月20日

图神经网络模型集合GraphGallery，TensorFLow&PyTorch一并实现

图神经网络模型集合GraphGallery，TensorFLow&PyTorch一并实现

专知

20+阅读 · 2020年10月5日

【Code】GraphSAGE 源码解析

【Code】GraphSAGE 源码解析

AINLP

31+阅读 · 2020年6月22日

Transformers就是图神经网络？NTU-Chaitanya Joshi论述: 是GNN的一个特例

Transformers就是图神经网络？NTU-Chaitanya Joshi论述: 是GNN的一个特例

专知

20+阅读 · 2020年3月1日

Tensorflow GNN实战：手把手教你使用tf_geometric构建图自编码器GAE（附完整代码）

Tensorflow GNN实战：手把手教你使用tf_geometric构建图自编码器GAE（附完整代码）

专知

69+阅读 · 2020年1月30日

【谷歌出品】272页PPT讲述Tensorflow2.0在图形学方面的应用，SIGGRAPH2019

【谷歌出品】272页PPT讲述Tensorflow2.0在图形学方面的应用，SIGGRAPH2019

专知

13+阅读 · 2019年10月10日

LeCun推荐：最新PyTorch图神经网络库，速度快15倍（GitHub+论文）

LeCun推荐：最新PyTorch图神经网络库，速度快15倍（GitHub+论文）

未来产业促进会

18+阅读 · 2019年3月10日

Github 项目推荐 | 论文的代码实现：可变形ConvNets v2的PyTorch实现

Github 项目推荐 | 论文的代码实现：可变形ConvNets v2的PyTorch实现

AI研习社

22+阅读 · 2019年1月10日

使用Python进行图像增强

使用Python进行图像增强

AI研习社

17+阅读 · 2018年9月30日

教程 | PyTorch经验指南：技巧与陷阱

教程 | PyTorch经验指南：技巧与陷阱

机器之心

16+阅读 · 2018年7月30日

实战 | 用Python做图像处理（二）

实战 | 用Python做图像处理（二）

七月在线实验室

17+阅读 · 2018年5月25日

曲面上图像处理的非局部变分模型与算法

国家自然科学基金

0+阅读 · 2017年12月31日

基于压缩感知理论的图像采样、编码和重建研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于部件结构的图像协同分割方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于形状信息和结果反馈的多图谱图像分割方法

国家自然科学基金

0+阅读 · 2015年12月31日

广义双随机相位编码系统中以QR码为载体的信息加密及无损恢复

国家自然科学基金

0+阅读 · 2015年12月31日

保持结构的交互式图像及视频编辑方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

图像修补中结构矩阵的预处理方法与理论

国家自然科学基金

1+阅读 · 2015年12月31日

图像复原中非凸稀疏优化问题的快速算法

国家自然科学基金

0+阅读 · 2015年12月31日

基于框架提升变换的多源图像融合研究

国家自然科学基金

1+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

PartRAG: Retrieval-Augmented Part-Level 3D Generation and Editing

Arxiv

0+阅读 · 2月19日

RefineFormer3D: Efficient 3D Medical Image Segmentation via Adaptive Multi-Scale Transformer with Cross Attention Fusion

Arxiv

0+阅读 · 2月18日

GraphFM: A generalist graph transformer that learns transferable representations across diverse domains

Arxiv

0+阅读 · 2月14日

FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing

Arxiv

0+阅读 · 2月13日

Raster2Seq: Polygon Sequence Generation for Floorplan Reconstruction

Arxiv

0+阅读 · 2月9日

FusionEdit: Semantic Fusion and Attention Modulation for Training-Free Image Editing

Arxiv

0+阅读 · 2月9日

Repairing Property Graphs under PG-Constraints

Arxiv

0+阅读 · 2月5日

GraDE: A Graph Diffusion Estimator for Frequent Subgraph Discovery in Neural Architectures

Arxiv

0+阅读 · 2月3日

GraphTARIF: Linear Graph Transformer with Augmented Rank and Improved Focus

Arxiv

0+阅读 · 1月28日

GraphBench: Next-generation graph learning benchmarking

Arxiv

0+阅读 · 1月18日

VIP会员

文章信息

相关主题

动态控制流

相关VIP内容

图增强生成（GraphRAG）

图增强生成（GraphRAG）

专知会员服务

34+阅读 · 2025年1月4日

【IJCAI2024】Gradformer：具有指数衰减的图变换器

【IJCAI2024】Gradformer：具有指数衰减的图变换器

专知会员服务

17+阅读 · 2024年4月25日

Graph Transformer近期进展

Graph Transformer近期进展

专知会员服务

65+阅读 · 2023年1月5日

如何重构图神经网络？98页LoG2022《图重连:从理论到应用》教程，附代码

如何重构图神经网络？98页LoG2022《图重连:从理论到应用》教程，附代码

专知会员服务

44+阅读 · 2022年12月13日

tf_geometric — 基于TensorFlow的友好高效的图神经网络（GNN）库

tf_geometric — 基于TensorFlow的友好高效的图神经网络（GNN）库

专知会员服务

26+阅读 · 2021年8月9日

【干货书】深度学习Pytorch快速入门，150页pdf，Deep Learning with PyTorch

【干货书】深度学习Pytorch快速入门，150页pdf，Deep Learning with PyTorch

专知会员服务

156+阅读 · 2021年4月4日

【KDD2020-清华大学】自适应图编码器，Adaptive Graph Encoder for Attributed Graph Embedding

【KDD2020-清华大学】自适应图编码器，Adaptive Graph Encoder for Attributed Graph Embedding

专知会员服务

99+阅读 · 2020年7月6日

GRAPH-BERT ：学习图表示只需要注意力，GRAPH-BERT : Only Attention is Needed for Learning Graph Representations

GRAPH-BERT ：学习图表示只需要注意力，GRAPH-BERT : Only Attention is Needed for Learning Graph Representations

专知会员服务

78+阅读 · 2020年5月31日

【CIKM 2019论文】重力启发式图自编码器定向链路预测（Gravity-Inspired Graph Autoencoders for Directed Link Prediction），Guillaume Salha，Stratis Limnios

【CIKM 2019论文】重力启发式图自编码器定向链路预测（Gravity-Inspired Graph Autoencoders for Directed Link Prediction），Guillaume Salha，Stratis Limnios

专知会员服务

28+阅读 · 2019年11月20日

【ICDAR2019教程】模式识别和文档图像中基于图的方法，Graph-based Methods in Pattern Recognition and Document Image Analysis

【ICDAR2019教程】模式识别和文档图像中基于图的方法，Graph-based Methods in Pattern Recognition and Document Image Analysis

专知会员服务

30+阅读 · 2019年9月20日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体记忆深度剖析：评价指标与系统局限性的分类体系及实证分析

《可信人工智能赋能系统的支柱》

【CMU博士论文】可靠轨迹预测的分层基石：数据、评估与方法

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

相关资讯

图神经网络模型集合GraphGallery，TensorFLow&PyTorch一并实现

图神经网络模型集合GraphGallery，TensorFLow&PyTorch一并实现

专知

20+阅读 · 2020年10月5日

【Code】GraphSAGE 源码解析

【Code】GraphSAGE 源码解析

AINLP

31+阅读 · 2020年6月22日

Transformers就是图神经网络？NTU-Chaitanya Joshi论述: 是GNN的一个特例

Transformers就是图神经网络？NTU-Chaitanya Joshi论述: 是GNN的一个特例

专知

20+阅读 · 2020年3月1日

Tensorflow GNN实战：手把手教你使用tf_geometric构建图自编码器GAE（附完整代码）

Tensorflow GNN实战：手把手教你使用tf_geometric构建图自编码器GAE（附完整代码）

专知

69+阅读 · 2020年1月30日

【谷歌出品】272页PPT讲述Tensorflow2.0在图形学方面的应用，SIGGRAPH2019

【谷歌出品】272页PPT讲述Tensorflow2.0在图形学方面的应用，SIGGRAPH2019

专知

13+阅读 · 2019年10月10日

LeCun推荐：最新PyTorch图神经网络库，速度快15倍（GitHub+论文）

LeCun推荐：最新PyTorch图神经网络库，速度快15倍（GitHub+论文）

未来产业促进会

18+阅读 · 2019年3月10日

Github 项目推荐 | 论文的代码实现：可变形ConvNets v2的PyTorch实现

Github 项目推荐 | 论文的代码实现：可变形ConvNets v2的PyTorch实现

AI研习社

22+阅读 · 2019年1月10日

使用Python进行图像增强

使用Python进行图像增强

AI研习社

17+阅读 · 2018年9月30日

教程 | PyTorch经验指南：技巧与陷阱

教程 | PyTorch经验指南：技巧与陷阱

机器之心

16+阅读 · 2018年7月30日

实战 | 用Python做图像处理（二）

实战 | 用Python做图像处理（二）

七月在线实验室

17+阅读 · 2018年5月25日

相关论文

PartRAG: Retrieval-Augmented Part-Level 3D Generation and Editing

Arxiv

0+阅读 · 2月19日

RefineFormer3D: Efficient 3D Medical Image Segmentation via Adaptive Multi-Scale Transformer with Cross Attention Fusion

Arxiv

0+阅读 · 2月18日

GraphFM: A generalist graph transformer that learns transferable representations across diverse domains

Arxiv

0+阅读 · 2月14日

FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing

Arxiv

0+阅读 · 2月13日

Raster2Seq: Polygon Sequence Generation for Floorplan Reconstruction

Arxiv

0+阅读 · 2月9日

FusionEdit: Semantic Fusion and Attention Modulation for Training-Free Image Editing

Arxiv

0+阅读 · 2月9日

Repairing Property Graphs under PG-Constraints

Arxiv

0+阅读 · 2月5日

GraDE: A Graph Diffusion Estimator for Frequent Subgraph Discovery in Neural Architectures

Arxiv

0+阅读 · 2月3日

GraphTARIF: Linear Graph Transformer with Augmented Rank and Improved Focus

Arxiv

0+阅读 · 1月28日

GraphBench: Next-generation graph learning benchmarking

Arxiv

0+阅读 · 1月18日

相关基金

曲面上图像处理的非局部变分模型与算法

国家自然科学基金

0+阅读 · 2017年12月31日

基于压缩感知理论的图像采样、编码和重建研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于部件结构的图像协同分割方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于形状信息和结果反馈的多图谱图像分割方法

国家自然科学基金

0+阅读 · 2015年12月31日

广义双随机相位编码系统中以QR码为载体的信息加密及无损恢复

国家自然科学基金

0+阅读 · 2015年12月31日

保持结构的交互式图像及视频编辑方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

图像修补中结构矩阵的预处理方法与理论

国家自然科学基金

1+阅读 · 2015年12月31日

图像复原中非凸稀疏优化问题的快速算法

国家自然科学基金

0+阅读 · 2015年12月31日

基于框架提升变换的多源图像融合研究

国家自然科学基金

1+阅读 · 2015年12月31日

非局部总变差正则化图像恢复模型的快速子空间校正算法

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员