Sequential Federated Learning in Hierarchical Architecture on Non-IID Datasets

In a real federated learning (FL) system, communication overhead for passing model parameters between the clients and the parameter server (PS) is often a bottleneck. Hierarchical federated learning (HFL) that poses multiple edge servers (ESs) between clients and the PS can partially alleviate communication pressure but still needs the aggregation of model parameters from multiple ESs at the PS. To further reduce communication overhead, we bring sequential FL (SFL) into HFL for the first time, which removes the central PS and enables the model training to be completed only through passing the global model between two adjacent ESs for each iteration, and propose a novel algorithm adaptive to such a combinational framework, referred to as Fed-CHS. Convergence results are derived for strongly convex and non-convex loss functions under various data heterogeneity setups, which show comparable convergence performance with the algorithms for HFL or SFL solely. Experimental results provide evidence of the superiority of our proposed Fed-CHS on both communication overhead saving and test accuracy over baseline methods.

翻译：在实际的联邦学习系统中，客户端与参数服务器之间传递模型参数所产生的通信开销往往是性能瓶颈。通过在客户端与参数服务器之间引入多个边缘服务器构成的分层联邦学习架构，可以部分缓解通信压力，但仍需在参数服务器处聚合来自多个边缘服务器的模型参数。为了进一步降低通信开销，我们首次将顺序联邦学习引入分层架构，该方法移除了中心参数服务器，使得模型训练在每次迭代中仅需通过相邻边缘服务器之间传递全局模型即可完成，并提出了一种适应于此组合框架的新算法，称为Fed-CHS。我们在多种数据异构设置下，针对强凸和非凸损失函数推导了收敛性结果，表明其收敛性能与单独使用分层联邦学习或顺序联邦学习的算法相当。实验结果证明，相较于基线方法，我们提出的Fed-CHS在通信开销节省和测试准确率方面均具有优越性。

相关内容

联邦学习

关注 200

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日