Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks

The fusion of raw features from multiple sensors on an autonomous vehicle to create a Bird's Eye View (BEV) representation is crucial for planning and control systems. There is growing interest in using deep learning models for BEV semantic segmentation. Anticipating segmentation errors and improving the explainability of DNNs is essential for autonomous driving, yet it is under-studied. This paper introduces a benchmark for predictive uncertainty quantification in BEV segmentation. The benchmark assesses various approaches across three popular datasets using two representative backbones and focuses on the effectiveness of predicted uncertainty in identifying misclassified and out-of-distribution (OOD) pixels, as well as calibration. Empirical findings highlight the challenges in uncertainty quantification. Our results find that evidential deep learning based approaches show the most promise by efficiently quantifying aleatoric and epistemic uncertainty. We propose the Uncertainty-Focal-Cross-Entropy (UFCE) loss, designed for highly imbalanced data, which consistently improves the segmentation quality and calibration. Additionally, we introduce a vacuity-scaled regularization term that enhances the model's focus on high uncertainty pixels, improving epistemic uncertainty quantification.

翻译：自动驾驶车辆通过融合多个传感器的原始特征生成鸟瞰图表示，这对于规划与控制系统至关重要。目前，利用深度学习模型进行鸟瞰图语义分割的研究日益增多。然而，预测分割错误并提升深度神经网络的可解释性对于自动驾驶至关重要，但相关研究仍显不足。本文提出了一个鸟瞰图分割中预测不确定性量化的基准。该基准使用两种代表性骨干网络，在三个常用数据集上评估了多种方法，重点关注预测不确定性在识别误分类像素、分布外像素以及校准方面的有效性。实证结果突显了不确定性量化所面临的挑战。研究发现，基于证据深度学习的方法通过有效量化任意不确定性和认知不确定性，展现出最大的潜力。我们提出了专为高度不平衡数据设计的"不确定性-焦点-交叉熵"损失函数，该函数持续提升了分割质量与校准性能。此外，我们引入了一个基于空值缩放的规范化项，增强了模型对高不确定性像素的关注，从而改善了认知不确定性的量化。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

14+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日