Federated Learning as a Service for Hierarchical Edge Networks with Heterogeneous Models

Federated learning (FL) is a distributed Machine Learning (ML) framework that is capable of training a new global model by aggregating clients' locally trained models without sharing users' original data. Federated learning as a service (FLaaS) offers a privacy-preserving approach for training machine learning models on devices with various computational resources. Most proposed FL-based methods train the same model in all client devices regardless of their computational resources. However, in practical Internet of Things (IoT) scenarios, IoT devices with limited computational resources may not be capable of training models that client devices with greater hardware performance hosted. Most of the existing FL frameworks that aim to solve the problem of aggregating heterogeneous models are designed for Independent and Identical Distributed (IID) data, which may make it hard to reach the target algorithm performance when encountering non-IID scenarios. To address these problems in hierarchical networks, in this paper, we propose a heterogeneous aggregation framework for hierarchical edge systems called HAF-Edge. In our proposed framework, we introduce a communication-efficient model aggregation method designed for FL systems with two-level model aggregations running at the edge and cloud levels. This approach enhances the convergence rate of the global model by leveraging selective knowledge transfer during the aggregation of heterogeneous models. To the best of our knowledge, this work is pioneering in addressing the problem of aggregating heterogeneous models within hierarchical FL systems spanning IoT, edge, and cloud environments. We conducted extensive experiments to validate the performance of our proposed method. The evaluation results demonstrate that HAF-Edge significantly outperforms state-of-the-art methods.

翻译：联邦学习（FL）是一种分布式机器学习（ML）框架，能够通过聚合客户端本地训练的模型来训练新的全局模型，而无需共享用户的原始数据。联邦学习即服务（FLaaS）提供了一种隐私保护方法，可在具有不同计算资源的设备上训练机器学习模型。大多数已提出的基于FL的方法在所有客户端设备上训练相同的模型，而忽略了其计算资源的差异。然而，在实际的物联网（IoT）场景中，计算资源有限的物联网设备可能无法训练那些硬件性能更强的客户端设备所承载的模型。现有大多数旨在解决异构模型聚合问题的FL框架都是为独立同分布（IID）数据设计的，这在遇到非IID场景时可能难以达到目标算法性能。为解决分层网络中的这些问题，本文提出了一种面向分层边缘系统的异构聚合框架，称为HAF-Edge。在我们提出的框架中，我们引入了一种通信高效的模型聚合方法，专为在边缘和云层级运行两级模型聚合的FL系统设计。该方法通过在异构模型聚合过程中利用选择性知识传递，提高了全局模型的收敛速度。据我们所知，这项工作是首次尝试解决跨越物联网、边缘和云环境的分层FL系统中异构模型聚合问题。我们进行了大量实验以验证所提方法的性能。评估结果表明，HAF-Edge显著优于现有最先进的方法。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日