On-device Training: A First Overview on Existing Systems

The recent breakthroughs in machine learning (ML) and deep learning (DL) have catalyzed the design and development of various intelligent systems over wide application domains. While most existing machine learning models require large memory and computing power, efforts have been made to deploy some models on resource-constrained devices as well. A majority of the early application systems focused on exploiting the inference capabilities of ML and DL models, where data captured from different mobile and embedded sensing components are processed through these models for application goals such as classification and segmentation. More recently, the concept of exploiting the mobile and embedded computing resources for ML/DL model training has gained attention, as such capabilities allow (i) the training of models via local data without the need to share data over wireless links, thus enabling privacy-preserving computation by design, (ii) model personalization and environment adaptation, and (ii) deployment of accurate models in remote and hardly accessible locations without stable internet connectivity. This work targets to summarize and analyze state-of-the-art systems research that allows such on-device model training capabilities and provide a survey of on-device training from a systems perspective.

翻译：近期机器学习和深度学习的突破性进展推动了各类智能系统在广泛领域中的设计与开发。尽管现有大多数机器学习模型需要大量内存和计算能力，但研究人员已努力将部分模型部署在资源受限设备上。早期应用系统主要侧重于利用机器学习和深度学习模型的推理能力，即通过移动设备及嵌入式传感组件采集的数据经模型处理后实现分类、分割等应用目标。近期，利用移动和嵌入式计算资源进行模型训练的概念逐渐受到关注，此类能力可实现：（i）通过本地数据进行模型训练，无需通过无线链路共享数据，从而在设计层面实现隐私保护计算；（ii）模型个性化与环境自适应；（iii）在无稳定互联网连接的偏远及难以到达区域部署高精度模型。本研究旨在总结与分析当前支持此类设备端训练能力的系统前沿研究，并从系统视角对设备端训练进行综述。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【开放书】设计机器学习系统，Designing Machine Learning Systems

专知会员服务

78+阅读 · 2022年5月17日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

105+阅读 · 2022年2月10日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日