FisheyeDetNet: Object Detection on Fisheye Surround View Camera Systems for Automated Driving

Object detection is a mature problem in autonomous driving with pedestrian detection being one of the first deployed algorithms. It has been comprehensively studied in the literature. However, object detection is relatively less explored for fisheye cameras used for surround-view near field sensing. The standard bounding box representation fails in fisheye cameras due to heavy radial distortion, particularly in the periphery. To mitigate this, we explore extending the standard object detection output representation of bounding box. We design rotated bounding boxes, ellipse, generic polygon as polar arc/angle representations and define an instance segmentation mIOU metric to analyze these representations. The proposed model FisheyeDetNet with polygon outperforms others and achieves a mAP score of 49.5 % on Valeo fisheye surround-view dataset for automated driving applications. This dataset has 60K images captured from 4 surround-view cameras across Europe, North America and Asia. To the best of our knowledge, this is the first detailed study on object detection on fisheye cameras for autonomous driving scenarios.

翻译：目标检测是自动驾驶中一个成熟的问题，其中行人检测是最早部署的算法之一，已在文献中得到全面研究。然而，用于环绕视图近场感知的鱼眼摄像机在目标检测方面的探索相对较少。由于严重的径向畸变，特别是在图像边缘区域，标准边界框表示在鱼眼相机中效果不佳。为解决这一问题，我们探索扩展标准目标检测输出中的边界框表示。我们设计了旋转边界框、椭圆、通用多边形作为极坐标弧/角度表示，并定义实例分割mIOU指标来分析这些表示。所提出的FisheyeDetNet模型采用多边形表示，性能优于其他方法，在用于自动驾驶应用的Valeo鱼眼环绕视图数据集上实现了49.5%的mAP得分。该数据集包含来自欧洲、北美和亚洲四个环绕视图摄像头采集的60K张图像。据我们所知，这是针对自动驾驶场景下鱼眼相机目标检测的首次详细研究。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日