Ev-Edge: Efficient Execution of Event-based Vision Algorithms on Commodity Edge Platforms

Event cameras have emerged as a promising sensing modality for autonomous navigation systems, owing to their high temporal resolution, high dynamic range and negligible motion blur. To process the asynchronous temporal event streams from such sensors, recent research has shown that a mix of Artificial Neural Networks (ANNs), Spiking Neural Networks (SNNs) as well as hybrid SNN-ANN algorithms are necessary to achieve high accuracies across a range of perception tasks. However, we observe that executing such workloads on commodity edge platforms which feature heterogeneous processing elements such as CPUs, GPUs and neural accelerators results in inferior performance. This is due to the mismatch between the irregular nature of event streams and diverse characteristics of algorithms on the one hand and the underlying hardware platform on the other. We propose Ev-Edge, a framework that contains three key optimizations to boost the performance of event-based vision systems on edge platforms: (1) An Event2Sparse Frame converter directly transforms raw event streams into sparse frames, enabling the use of sparse libraries with minimal encoding overheads (2) A Dynamic Sparse Frame Aggregator merges sparse frames at runtime by trading off the temporal granularity of events and computational demand thereby improving hardware utilization (3) A Network Mapper maps concurrently executing tasks to different processing elements while also selecting layer precision by considering both compute and communication overheads. On several state-of-art networks for a range of autonomous navigation tasks, Ev-Edge achieves 1.28x-2.05x improvements in latency and 1.23x-2.15x in energy over an all-GPU implementation on the NVIDIA Jetson Xavier AGX platform for single-task execution scenarios. Ev-Edge also achieves 1.43x-1.81x latency improvements over round-robin scheduling methods in multi-task execution scenarios.

翻译：事件相机凭借其高时间分辨率、高动态范围与极低运动模糊特性，已成为自主导航系统极具前景的传感模态。为处理此类传感器产生的异步时间事件流，近期研究表明需融合人工神经网络（ANN）、脉冲神经网络（SNN）及混合SNN-ANN算法，才能在多类感知任务中实现高精度。然而，我们观察到在配备CPU、GPU与神经加速器等异构处理单元的商品化边缘平台上执行此类工作负载时，性能表现不佳。其根源在于事件流的非规则性、算法多样性特征与底层硬件平台之间存在失配。为此，我们提出Ev-Edge框架，包含三项关键优化以提升边缘平台事件驱动视觉系统性能：（1）事件转稀疏帧转换器（Event2Sparse Frame），将原始事件流直接转换为稀疏帧，以最小编码开销实现稀疏库调用；（2）动态稀疏帧聚合器（Dynamic Sparse Frame Aggregator），通过权衡事件时间粒度与计算需求运行时合并稀疏帧，提升硬件利用率；（3）网络映射器（Network Mapper），在考虑计算与通信开销的前提下，将并行执行任务映射至不同处理单元并选择层精度。在面向多类自主导航任务的最优网络架构上，Ev-Edge于NVIDIA Jetson Xavier AGX平台单任务执行场景中，相较纯GPU实现实现1.28倍至2.05倍延迟优化与1.23倍至2.15倍能效提升；在多任务执行场景中，相较轮询调度方法实现1.43倍至1.81倍延迟改进。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日