Mobile video applications today have attracted significant attention. Deep learning model (e.g. deep neural network, DNN) compression is widely used to enable on-device inference for facilitating robust and private mobile video applications. The compressed DNN, however, is vulnerable to the agnostic data drift of the live video captured from the dynamically changing mobile scenarios. To combat the data drift, mobile ends rely on edge servers to continuously evolve and re-compress the DNN with freshly collected data. We design a framework, AdaEvo, that efficiently supports the resource-limited edge server handling mobile DNN evolution tasks from multiple mobile ends. The key goal of AdaEvo is to maximize the average quality of experience (QoE), e.g. the proportion of high-quality DNN service time to the entire life cycle, for all mobile ends. Specifically, it estimates the DNN accuracy drops at the mobile end without labels and performs a dedicated video frame sampling strategy to control the size of retraining data. In addition, it balances the limited computing and memory resources on the edge server and the competition between asynchronous tasks initiated by different mobile users. With an extensive evaluation of real-world videos from mobile scenarios and across four diverse mobile tasks, experimental results show that AdaEvo enables up to 34% accuracy improvement and 32% average QoE improvement.
翻译:摘要:当前移动视频应用已引起广泛关注。深度学习模型(如深度神经网络DNN)压缩技术被广泛用于实现设备端推理,以支持鲁棒且隐私保护的移动视频应用。然而,压缩后的DNN容易受到动态变化的移动场景中实时视频数据分布漂移的影响。为应对数据漂移,移动终端需依赖边缘服务器利用新采集数据持续演化并重新压缩DNN。本文设计了AdaEvo框架,高效支持资源受限的边缘服务器处理来自多个移动终端的DNN演化任务。AdaEvo的核心目标是最大化所有移动终端的平均体验质量(QoE),即高质量DNN服务时间在整个生命周期中的占比。具体而言,该框架在无标签情况下估计移动端DNN精度下降,并采用专用视频帧采样策略控制重训练数据规模。此外,它平衡了边缘服务器上有限的计算与内存资源,以及不同移动用户发起的异步任务之间的竞争。通过对移动场景真实视频及四个多样化移动任务的广泛评估,实验结果表明AdaEvo可实现高达34%的精度提升和32%的平均QoE提升。