MeanAP-Guided Reinforced Active Learning for Object Detection

Active learning presents a promising avenue for training high-performance models with minimal labeled data, achieved by judiciously selecting the most informative instances to label and incorporating them into the task learner. Despite notable advancements in active learning for image recognition, metrics devised or learned to gauge the information gain of data, crucial for query strategy design, do not consistently align with task model performance metrics, such as Mean Average Precision (MeanAP) in object detection tasks. This paper introduces MeanAP-Guided Reinforced Active Learning for Object Detection (MAGRAL), a novel approach that directly utilizes the MeanAP metric of the task model to devise a sampling strategy employing a reinforcement learning-based sampling agent. Built upon LSTM architecture, the agent efficiently explores and selects subsequent training instances, and optimizes the process through policy gradient with MeanAP serving as reward. Recognizing the time-intensive nature of MeanAP computation at each step, we propose fast look-up tables to expedite agent training. We assess MAGRAL's efficacy across popular benchmarks, PASCAL VOC and MS COCO, utilizing different backbone architectures. Empirical findings substantiate MAGRAL's superiority over recent state-of-the-art methods, showcasing substantial performance gains. MAGRAL establishes a robust baseline for reinforced active object detection, signifying its potential in advancing the field.

翻译：主动学习通过精心选择最具信息量的样本进行标注并将其纳入任务学习器，为用最少标注数据训练高性能模型提供了一条有前景的路径。尽管图像识别领域的主动学习取得了显著进展，但用于衡量数据信息增益（这对查询策略设计至关重要）的指标或学习方法，与任务模型性能指标（如目标检测任务中的平均精密度MeanAP）并不总是一致。本文提出面向目标检测的MeanAP引导强化主动学习（MAGRAL），这是一种直接利用任务模型MeanAP指标设计采样策略的新方法，该方法采用基于强化学习的采样智能体。该智能体基于LSTM架构，能够高效探索并选择后续训练样本，通过以MeanAP作为奖励的策略梯度来优化过程。针对每步MeanAP计算耗时的问题，我们提出快速查找表以加速智能体训练。我们在PASCAL VOC和MS COCO等主流基准上，采用不同骨干网络架构评估了MAGRAL的效果。实验结果证实，MAGRAL的性能优于近期最先进方法，展现出显著性能提升。MAGRAL为强化主动目标检测建立了稳健的基线，彰显了其在推动该领域发展方面的潜力。

相关内容

主动学习

关注 243

主动学习是机器学习（更普遍的说是人工智能）的一个子领域，在统计学领域也叫查询学习、最优实验设计。“学习模块”和“选择策略”是主动学习算法的2个基本且重要的模块。主动学习是“一种学习方法，在这种方法中，学生会主动或体验性地参与学习过程，并且根据学生的参与程度，有不同程度的主动学习。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指出：“学生除了被动地听课以外，还从事其他活动。” 在高等教育研究协会（ASHE）的一份报告中，作者讨论了各种促进主动学习的方法。他们引用了一些文献，这些文献表明学生不仅要做听，还必须做更多的事情才能学习。他们必须阅读，写作，讨论并参与解决问题。此过程涉及三个学习领域，即知识，技能和态度（KSA）。这种学习行为分类法可以被认为是“学习过程的目标”。特别是，学生必须从事诸如分析，综合和评估之类的高级思维任务。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日