Federated learning (FL) is a distributed Machine Learning (ML) framework that is capable of training a new global model by aggregating clients' locally trained models without sharing users' original data. Federated learning as a service (FLaaS) offers a privacy-preserving approach for training machine learning models on devices with various computational resources. Most proposed FL-based methods train the same model in all client devices regardless of their computational resources. However, in practical Internet of Things (IoT) scenarios, IoT devices with limited computational resources may not be capable of training models that client devices with greater hardware performance hosted. Most of the existing FL frameworks that aim to solve the problem of aggregating heterogeneous models are designed for Independent and Identical Distributed (IID) data, which may make it hard to reach the target algorithm performance when encountering non-IID scenarios. To address these problems in hierarchical networks, in this paper, we propose a heterogeneous aggregation framework for hierarchical edge systems called HAF-Edge. In our proposed framework, we introduce a communication-efficient model aggregation method designed for FL systems with two-level model aggregations running at the edge and cloud levels. This approach enhances the convergence rate of the global model by leveraging selective knowledge transfer during the aggregation of heterogeneous models. To the best of our knowledge, this work is pioneering in addressing the problem of aggregating heterogeneous models within hierarchical FL systems spanning IoT, edge, and cloud environments. We conducted extensive experiments to validate the performance of our proposed method. The evaluation results demonstrate that HAF-Edge significantly outperforms state-of-the-art methods.
翻译:联邦学习(FL)是一种分布式机器学习(ML)框架,能够通过聚合客户端本地训练的模型来训练新的全局模型,而无需共享用户的原始数据。联邦学习即服务(FLaaS)提供了一种隐私保护方法,可在具有不同计算资源的设备上训练机器学习模型。大多数已提出的基于FL的方法在所有客户端设备上训练相同的模型,而忽略了其计算资源的差异。然而,在实际的物联网(IoT)场景中,计算资源有限的物联网设备可能无法训练那些硬件性能更强的客户端设备所承载的模型。现有大多数旨在解决异构模型聚合问题的FL框架都是为独立同分布(IID)数据设计的,这在遇到非IID场景时可能难以达到目标算法性能。为解决分层网络中的这些问题,本文提出了一种面向分层边缘系统的异构聚合框架,称为HAF-Edge。在我们提出的框架中,我们引入了一种通信高效的模型聚合方法,专为在边缘和云层级运行两级模型聚合的FL系统设计。该方法通过在异构模型聚合过程中利用选择性知识传递,提高了全局模型的收敛速度。据我们所知,这项工作是首次尝试解决跨越物联网、边缘和云环境的分层FL系统中异构模型聚合问题。我们进行了大量实验以验证所提方法的性能。评估结果表明,HAF-Edge显著优于现有最先进的方法。