MaGNAS: A Mapping-Aware Graph Neural Architecture Search Framework for Heterogeneous MPSoC Deployment

from arxiv, This article appears as part of the ESWEEK-TECS special issue and was presented in the International Conference on Compilers, Architectures, and Synthesis for Embedded Systems (CASES), 2023

Graph Neural Networks (GNNs) are becoming increasingly popular for vision-based applications due to their intrinsic capacity in modeling structural and contextual relations between various parts of an image frame. On another front, the rising popularity of deep vision-based applications at the edge has been facilitated by the recent advancements in heterogeneous multi-processor Systems on Chips (MPSoCs) that enable inference under real-time, stringent execution requirements. By extension, GNNs employed for vision-based applications must adhere to the same execution requirements. Yet contrary to typical deep neural networks, the irregular flow of graph learning operations poses a challenge to running GNNs on such heterogeneous MPSoC platforms. In this paper, we propose a novel unified design-mapping approach for efficient processing of vision GNN workloads on heterogeneous MPSoC platforms. Particularly, we develop MaGNAS, a mapping-aware Graph Neural Architecture Search framework. MaGNAS proposes a GNN architectural design space coupled with prospective mapping options on a heterogeneous SoC to identify model architectures that maximize on-device resource efficiency. To achieve this, MaGNAS employs a two-tier evolutionary search to identify optimal GNNs and mapping pairings that yield the best performance trade-offs. Through designing a supernet derived from the recent Vision GNN (ViG) architecture, we conducted experiments on four (04) state-of-the-art vision datasets using both (i) a real hardware SoC platform (NVIDIA Xavier AGX) and (ii) a performance/cost model simulator for DNN accelerators. Our experimental results demonstrate that MaGNAS is able to provide 1.57x latency speedup and is 3.38x more energy-efficient for several vision datasets executed on the Xavier MPSoC vs. the GPU-only deployment while sustaining an average 0.11% accuracy reduction from the baseline.

翻译：图神经网络（GNN）因其在建模图像帧各部分间结构与上下文关系方面的内在能力，正日益广泛用于视觉应用。另一方面，异构多处理器片上系统（MPSoC）的最新进展使得在边缘端运行新型深度视觉应用成为可能，这些系统能在实时、严格的执行需求下实现推理。由此延伸，面向视觉应用的GNN也必须遵循相同的执行要求。然而，与典型深度神经网络不同，图学习操作的不规则流程为在异构MPSoC平台上运行GNN带来了挑战。本文提出一种新颖的统一设计-映射方法，以在异构MPSoC平台上高效处理视觉GNN工作负载。具体而言，我们开发了MaGNAS——一种映射感知型图神经架构搜索框架。MaGNAS将GNN架构设计空间与异构SoC上的潜在映射选项相结合，旨在识别能最大化设备端资源效率的模型架构。为此，MaGNAS采用双层进化搜索方法，以识别产生最佳性能权衡的最优GNN与映射配对。通过设计基于最新Vision GNN（ViG）架构的超网络，我们分别在（i）真实硬件SoC平台（NVIDIA Xavier AGX）和（ii）面向DNN加速器的性能/成本模型模拟器上，针对四个（04）主流视觉数据集进行了实验。实验结果表明，在Xavier MPSoC上执行多个视觉数据集时，与仅使用GPU的部署方案相比，MaGNAS可实现1.57倍的延迟加速，能效提升3.38倍，同时较基线仅平均降低0.11%的精度。