ActiveAnno3D - An Active Learning Framework for Multi-Modal 3D Object Detection

The curation of large-scale datasets is still costly and requires much time and resources. Data is often manually labeled, and the challenge of creating high-quality datasets remains. In this work, we fill the research gap using active learning for multi-modal 3D object detection. We propose ActiveAnno3D, an active learning framework to select data samples for labeling that are of maximum informativeness for training. We explore various continuous training methods and integrate the most efficient method regarding computational demand and detection performance. Furthermore, we perform extensive experiments and ablation studies with BEVFusion and PV-RCNN on the nuScenes and TUM Traffic Intersection dataset. We show that we can achieve almost the same performance with PV-RCNN and the entropy-based query strategy when using only half of the training data (77.25 mAP compared to 83.50 mAP) of the TUM Traffic Intersection dataset. BEVFusion achieved an mAP of 64.31 when using half of the training data and 75.0 mAP when using the complete nuScenes dataset. We integrate our active learning framework into the proAnno labeling tool to enable AI-assisted data selection and labeling and minimize the labeling costs. Finally, we provide code, weights, and visualization results on our website: https://active3d-framework.github.io/active3d-framework.

翻译：大规模数据集的构建成本依然高昂，且需要耗费大量时间与资源。数据通常依赖人工标注，如何创建高质量数据集仍是挑战。本文利用主动学习方法填补多模态三维目标检测领域的研究空白。我们提出ActiveAnno3D——一种用于筛选高信息密度训练样本的主动学习框架。通过探索多种连续训练方法，我们整合了计算效率与检测性能最优的方案。此外，我们在nuScenes与TUM交通路口数据集上，结合BEVFusion与PV-RCNN开展大量实验与消融研究。实验表明，在TUM交通路口数据集中，使用仅一半训练数据时（平均精度77.25%对比83.50%），PV-RCNN结合基于熵的查询策略即可取得相近性能；而BEVFusion在利用半数训练数据时获得64.31%平均精度，使用完整nuScenes数据集时达到75.0%。我们将主动学习框架集成至proAnno标注工具中，实现AI辅助的数据筛选与标注，降低标注成本。最后，我们在网站提供了代码、权重及可视化结果：https://active3d-framework.github.io/active3d-framework。

相关内容

主动学习

关注 243

主动学习是机器学习（更普遍的说是人工智能）的一个子领域，在统计学领域也叫查询学习、最优实验设计。“学习模块”和“选择策略”是主动学习算法的2个基本且重要的模块。主动学习是“一种学习方法，在这种方法中，学生会主动或体验性地参与学习过程，并且根据学生的参与程度，有不同程度的主动学习。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指出：“学生除了被动地听课以外，还从事其他活动。” 在高等教育研究协会（ASHE）的一份报告中，作者讨论了各种促进主动学习的方法。他们引用了一些文献，这些文献表明学生不仅要做听，还必须做更多的事情才能学习。他们必须阅读，写作，讨论并参与解决问题。此过程涉及三个学习领域，即知识，技能和态度（KSA）。这种学习行为分类法可以被认为是“学习过程的目标”。特别是，学生必须从事诸如分析，综合和评估之类的高级思维任务。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日