Object detection is a mature problem in autonomous driving with pedestrian detection being one of the first deployed algorithms. It has been comprehensively studied in the literature. However, object detection is relatively less explored for fisheye cameras used for surround-view near field sensing. The standard bounding box representation fails in fisheye cameras due to heavy radial distortion, particularly in the periphery. To mitigate this, we explore extending the standard object detection output representation of bounding box. We design rotated bounding boxes, ellipse, generic polygon as polar arc/angle representations and define an instance segmentation mIOU metric to analyze these representations. The proposed model FisheyeDetNet with polygon outperforms others and achieves a mAP score of 49.5 % on Valeo fisheye surround-view dataset for automated driving applications. This dataset has 60K images captured from 4 surround-view cameras across Europe, North America and Asia. To the best of our knowledge, this is the first detailed study on object detection on fisheye cameras for autonomous driving scenarios.
翻译:目标检测是自动驾驶中一个成熟的问题,其中行人检测是最早部署的算法之一,已在文献中得到全面研究。然而,用于环绕视图近场感知的鱼眼摄像机在目标检测方面的探索相对较少。由于严重的径向畸变,特别是在图像边缘区域,标准边界框表示在鱼眼相机中效果不佳。为解决这一问题,我们探索扩展标准目标检测输出中的边界框表示。我们设计了旋转边界框、椭圆、通用多边形作为极坐标弧/角度表示,并定义实例分割mIOU指标来分析这些表示。所提出的FisheyeDetNet模型采用多边形表示,性能优于其他方法,在用于自动驾驶应用的Valeo鱼眼环绕视图数据集上实现了49.5%的mAP得分。该数据集包含来自欧洲、北美和亚洲四个环绕视图摄像头采集的60K张图像。据我们所知,这是针对自动驾驶场景下鱼眼相机目标检测的首次详细研究。