MEDL-U: Uncertainty-aware 3D Automatic Annotation based on Evidential Deep Learning

Advancements in deep learning-based 3D object detection necessitate the availability of large-scale datasets. However, this requirement introduces the challenge of manual annotation, which is often both burdensome and time-consuming. To tackle this issue, the literature has seen the emergence of several weakly supervised frameworks for 3D object detection which can automatically generate pseudo labels for unlabeled data. Nevertheless, these generated pseudo labels contain noise and are not as accurate as those labeled by humans. In this paper, we present the first approach that addresses the inherent ambiguities present in pseudo labels by introducing an Evidential Deep Learning (EDL) based uncertainty estimation framework. Specifically, we propose MEDL-U, an EDL framework based on MTrans, which not only generates pseudo labels but also quantifies the associated uncertainties. However, applying EDL to 3D object detection presents three primary challenges: (1) relatively lower pseudolabel quality in comparison to other autolabelers; (2) excessively high evidential uncertainty estimates; and (3) lack of clear interpretability and effective utilization of uncertainties for downstream tasks. We tackle these issues through the introduction of an uncertainty-aware IoU-based loss, an evidence-aware multi-task loss function, and the implementation of a post-processing stage for uncertainty refinement. Our experimental results demonstrate that probabilistic detectors trained using the outputs of MEDL-U surpass deterministic detectors trained using outputs from previous 3D annotators on the KITTI val set for all difficulty levels. Moreover, MEDL-U achieves state-of-the-art results on the KITTI official test set compared to existing 3D automatic annotators.

翻译：基于深度学习的三维目标检测技术的进步依赖于大规模数据集的可用性。然而，这一需求引入了人工标注的挑战，该过程通常既繁重又耗时。为解决该问题，文献中已出现多种弱监督三维目标检测框架，可自动为未标注数据生成伪标签。然而，这些生成的伪标签包含噪声，其准确度不及人工标注结果。本文首次提出通过引入基于证据深度学习（EDL）的不确定性估计框架来解决伪标签中固有的歧义性问题。具体而言，我们提出MEDL-U——一个基于MTrans的EDL框架，该框架不仅能生成伪标签，还能量化相关的不确定性。然而，将EDL应用于三维目标检测面临三大挑战：（1）伪标签质量相较于其他自动标注器较低；（2）证据不确定性估计值过高；（3）缺乏清晰的可解释性及对下游任务有效利用不确定性的方法。我们通过引入基于IoU的不确定性感知损失函数、证据感知多任务损失函数，以及实施不确定性后处理阶段来解决这些问题。实验结果表明，在KITTI验证集上，使用MEDL-U输出训练的基于概率的检测器在所有难度等级上均超越了使用先前三维标注器输出训练的确定性检测器。此外，与现有三维自动标注器相比，MEDL-U在KITTI官方测试集上取得了最先进的结果。