Implementation of a perception system for autonomous vehicles using a detection-segmentation network in SoC FPGA

Perception and control systems for autonomous vehicles are an active area of scientific and industrial research. These solutions should be characterised by high efficiency in recognising obstacles and other environmental elements in different road conditions, real-time capability, and energy efficiency. Achieving such functionality requires an appropriate algorithm and a suitable computing platform. In this paper, we have used the MultiTaskV3 detection-segmentation network as the basis for a perception system that can perform both functionalities within a single architecture. It was appropriately trained, quantised, and implemented on the AMD Xilinx Kria KV260 Vision AI embedded platform. By using this device, it was possible to parallelise and accelerate the computations. Furthermore, the whole system consumes relatively little power compared to a CPU-based implementation (an average of 5 watts, compared to the minimum of 55 watts for weaker CPUs, and the small size (119mm x 140mm x 36mm) of the platform allows it to be used in devices where the amount of space available is limited. It also achieves an accuracy higher than 97% of the mAP (mean average precision) for object detection and above 90% of the mIoU (mean intersection over union) for image segmentation. The article also details the design of the Mecanum wheel vehicle, which was used to test the proposed solution in a mock-up city.

翻译：自动驾驶车辆感知与控制系统是科学和工业研究中的活跃领域。此类解决方案需具备以下特性：在不同道路条件下高效识别障碍物及其他环境要素的能力、实时处理能力以及能源效率。实现上述功能需要合适的算法和计算平台。本文采用MultiTaskV3检测-分割网络作为感知系统的基础，该网络可在单一架构中同时实现两种功能。我们对网络进行了适当训练、量化处理，并在AMD Xilinx Kria KV260 Vision AI嵌入式平台上实现。通过使用该设备，我们实现了计算的并行化与加速。此外，与基于CPU的实现方案相比，整个系统功耗较低（平均5瓦，而最低功耗的CPU至少为55瓦），且平台小巧的尺寸（119mm x 140mm x 36mm）使其可应用于空间受限的设备。系统在目标检测方面实现了超过97%的mAP（平均精度均值）准确率，在图像分割方面实现了超过90%的mIoU（平均交并比）。本文还详细介绍了用于在模拟城市环境中测试所提方案的麦克纳姆轮车辆设计。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日