Accelerating Wireless Federated Learning via Nesterov's Momentum and Distributed Principle Component Analysis

A wireless federated learning system is investigated by allowing a server and workers to exchange uncoded information via orthogonal wireless channels. Since the workers frequently upload local gradients to the server via bandwidth-limited channels, the uplink transmission from the workers to the server becomes a communication bottleneck. Therefore, a one-shot distributed principle component analysis (PCA) is leveraged to reduce the dimension of uploaded gradients such that the communication bottleneck is relieved. A PCA-based wireless federated learning (PCA-WFL) algorithm and its accelerated version (i.e., PCA-AWFL) are proposed based on the low-dimensional gradients and the Nesterov's momentum. For the non-convex loss functions, a finite-time analysis is performed to quantify the impacts of system hyper-parameters on the convergence of the PCA-WFL and PCA-AWFL algorithms. The PCA-AWFL algorithm is theoretically certified to converge faster than the PCA-WFL algorithm. Besides, the convergence rates of PCA-WFL and PCA-AWFL algorithms quantitatively reveal the linear speedup with respect to the number of workers over the vanilla gradient descent algorithm. Numerical results are used to demonstrate the improved convergence rates of the proposed PCA-WFL and PCA-AWFL algorithms over the benchmarks.

翻译：本文通过允许服务器与工作节点经由正交无线信道交换未编码信息，对无线联邦学习系统展开研究。由于工作节点需频繁通过带宽受限信道向服务器上传本地梯度，上行传输链路成为通信瓶颈。为此，利用单次分布式主成分分析（PCA）降低上传梯度的维度以缓解通信瓶颈。基于低维梯度与Nesterov动量，提出基于PCA的无线联邦学习算法（PCA-WFL）及其加速版本（PCA-AWFL）。针对非凸损失函数，通过有限时间分析量化系统超参数对PCA-WFL和PCA-AWFL算法收敛性的影响。理论证明PCA-AWFL算法收敛速度快于PCA-WFL算法。此外，PCA-WFL与PCA-AWFL算法的收敛速率定量揭示了相对于原始梯度下降算法，其收敛速度与工作节点数量呈线性加速关系。数值结果验证了所提PCA-WFL与PCA-AWFL算法相比基准方法具有更优的收敛速率。

相关内容

PCA

关注 3

在统计中，主成分分析（PCA）是一种通过最大化每个维度的方差来将较高维度空间中的数据投影到较低维度空间中的方法。给定二维，三维或更高维空间中的点集合，可以将“最佳拟合”线定义为最小化从点到线的平均平方距离的线。可以从垂直于第一条直线的方向类似地选择下一条最佳拟合线。重复此过程会产生一个正交的基础，其中数据的不同单个维度是不相关的。这些基向量称为主成分。

《机器学习的最优传输》教程，63页PPT

专知会员服务

63+阅读 · 2022年4月30日

【CVPR 2022】基于本地正则化和稀疏化差分隐私的联邦学习，Differentially Private Federated Learning with Local Regularization and Sparsification

专知会员服务

17+阅读 · 2022年3月19日

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日