Active Learning for Deep Neural Networks on Edge Devices

When dealing with deep neural network (DNN) applications on edge devices, continuously updating the model is important. Although updating a model with real incoming data is ideal, using all of them is not always feasible due to limits, such as labeling and communication costs. Thus, it is necessary to filter and select the data to use for training (i.e., active learning) on the device. In this paper, we formalize a practical active learning problem for DNNs on edge devices and propose a general task-agnostic framework to tackle this problem, which reduces it to a stream submodular maximization. This framework is light enough to be run with low computational resources, yet provides solutions whose quality is theoretically guaranteed thanks to the submodular property. Through this framework, we can configure data selection criteria flexibly, including using methods proposed in previous active learning studies. We evaluate our approach on both classification and object detection tasks in a practical setting to simulate a real-life scenario. The results of our study show that the proposed framework outperforms all other methods in both tasks, while running at a practical speed on real devices.

翻译：在处理边缘设备上的深度神经网络（DNN）应用时，持续更新模型至关重要。尽管使用实时传入的数据更新模型是理想方案，但由于标注成本与通信开销等限制，并非总能利用全部数据。因此，有必要在设备上对训练数据进行筛选与选择（即主动学习）。本文形式化了边缘设备上DNN的实用主动学习问题，并提出一种通用的任务无关框架来应对该问题，将其简化为流式子模最大化。该框架计算开销极低，能在有限资源下运行，同时借助子模性质，提供理论质量保证的解决方案。通过该框架，可灵活配置数据选择准则，包括采用既往主动学习研究中的方法。我们在实际场景下对分类与目标检测任务进行了评估，以模拟真实应用环境。研究结果表明，所提框架在两项任务中均优于其他方法，且在真实设备上以实用速度运行。

相关内容

主动学习

关注 243

主动学习是机器学习（更普遍的说是人工智能）的一个子领域，在统计学领域也叫查询学习、最优实验设计。“学习模块”和“选择策略”是主动学习算法的2个基本且重要的模块。主动学习是“一种学习方法，在这种方法中，学生会主动或体验性地参与学习过程，并且根据学生的参与程度，有不同程度的主动学习。” （Bonwell＆Eison 1991）Bonwell＆Eison（1991）指出：“学生除了被动地听课以外，还从事其他活动。” 在高等教育研究协会（ASHE）的一份报告中，作者讨论了各种促进主动学习的方法。他们引用了一些文献，这些文献表明学生不仅要做听，还必须做更多的事情才能学习。他们必须阅读，写作，讨论并参与解决问题。此过程涉及三个学习领域，即知识，技能和态度（KSA）。这种学习行为分类法可以被认为是“学习过程的目标”。特别是，学生必须从事诸如分析，综合和评估之类的高级思维任务。