Federated edge learning (FEEL) is a popular distributed learning framework for privacy-preserving at the edge, in which densely distributed edge devices periodically exchange model-updates with the server to complete the global model training. Due to limited bandwidth and uncertain wireless environment, FEEL may impose heavy burden to the current communication system. In addition, under the common FEEL framework, the server needs to wait for the slowest device to complete the update uploading before starting the aggregation process, leading to the straggler issue that causes prolonged communication time. In this paper, we propose to accelerate FEEL from two aspects: i.e., 1) performing data compression on the edge devices and 2) setting a deadline on the edge server to exclude the straggler devices. However, undesired gradient compression errors and transmission outage are introduced by the aforementioned operations respectively, affecting the convergence of FEEL as well. In view of these practical issues, we formulate a training time minimization problem, with the compression ratio and deadline to be optimized. To this end, an asymptotically unbiased aggregation scheme is first proposed to ensure zero optimality gap after convergence, and the impact of compression error and transmission outage on the overall training time are quantified through convergence analysis. Then, the formulated problem is solved in an alternating manner, based on which, the novel joint compression and deadline optimization (JCDO) algorithm is derived. Numerical experiments for different use cases in FEEL including image classification and autonomous driving show that the proposed method is nearly 30X faster than the vanilla FedAVG algorithm, and outperforms the state-of-the-art schemes.
翻译:联邦边缘学习(FEEL)是一种流行的分布式学习框架,旨在边缘侧保护隐私,其中密集分布的边缘设备周期性地与服务器交换模型更新以完成全局模型训练。由于带宽有限且无线环境不确定,FEEL可能对当前通信系统造成沉重负担。此外,在常见的FEEL框架下,服务器需要等待最慢的设备完成更新上传才能启动聚合过程,这导致掉队者问题,从而延长通信时间。本文提出从两个方面加速FEEL:1)在边缘设备上执行数据压缩;2)在边缘服务器设置截止时间以排除掉队设备。然而,上述操作分别引入了不利的梯度压缩误差和传输中断,同样影响FEEL的收敛性。针对这些实际问题,我们构建了一个训练时间最小化问题,需要优化压缩比和截止时间。为此,首先提出一种渐近无偏的聚合方案,确保收敛后最优性差距为零,并通过收敛性分析量化压缩误差和传输中断对总体训练时间的影响。然后,通过交替方式求解所构建的问题,并由此推导出新颖的联合压缩与截止时间优化(JCDO)算法。针对FEEL中不同用例(包括图像分类和自动驾驶)的数值实验表明,所提方法比原始FedAVG算法快近30倍,并优于现有最优方案。