Agglomerative Federated Learning: Empowering Larger Model Training via End-Edge-Cloud Collaboration

Federated Learning (FL) enables training Artificial Intelligence (AI) models over end devices without compromising their privacy. As computing tasks are increasingly performed by a combination of cloud, edge, and end devices, FL can benefit from this End-Edge-Cloud Collaboration (EECC) paradigm to achieve collaborative device-scale expansion with real-time access. Although Hierarchical Federated Learning (HFL) supports multi-tier model aggregation suitable for EECC, prior works assume the same model structure on all computing nodes, constraining the model scale by the weakest end devices. To address this issue, we propose Agglomerative Federated Learning (FedAgg), which is a novel EECC-empowered FL framework that allows the trained models from end, edge, to cloud to grow larger in size and stronger in generalization ability. FedAgg recursively organizes computing nodes among all tiers based on Bridge Sample Based Online Distillation Protocol (BSBODP), which enables every pair of parent-child computing nodes to mutually transfer and distill knowledge extracted from generated bridge samples. This design enhances the performance by exploiting the potential of larger models, with privacy constraints of FL and flexibility requirements of EECC both satisfied. Experiments under various settings demonstrate that FedAgg outperforms state-of-the-art methods by an average of 4.53\% accuracy gains and remarkable improvements in convergence rate.

翻译：联邦学习（FL）能够在保护终端设备隐私的前提下训练人工智能（AI）模型。随着计算任务日益由云、边缘和终端设备协同完成，FL可借助这种端-边-云协作（EECC）范式实现实时接入下的协作式设备规模扩展。尽管分层联邦学习（HFL）支持适用于EECC的多层模型聚合，但现有研究均假设所有计算节点采用相同模型结构，导致模型规模受限于最弱的终端设备。针对该问题，我们提出聚合联邦学习（FedAgg）——一种新型的EECC赋能FL框架，允许从终端到边缘再到云端训练的模型在规模上逐步增大，泛化能力逐步增强。FedAgg基于桥接样本在线蒸馏协议（BSBODP），递归组织所有层级的计算节点，使每对父子计算节点能够相互迁移并蒸馏从生成的桥接样本中提取的知识。该设计在满足FL隐私约束和EECC灵活性要求的同时，通过挖掘更大模型的潜力来提升性能。多种场景下的实验表明，FedAgg相比现有最优方法平均准确率提升4.53%，收敛速度显著提高。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日