Federated learning (FL) systems face performance challenges in dealing with heterogeneous devices and non-identically distributed data across clients. We propose a dynamic global model aggregation method within Asynchronous Federated Learning (AFL) deployments to address these issues. Our aggregation method scores and adjusts the weighting of client model updates based on their upload frequency to accommodate differences in device capabilities. Additionally, we also immediately provide an updated global model to clients after they upload their local models to reduce idle time and improve training efficiency. We evaluate our approach within an AFL deployment consisting of 10 simulated clients with heterogeneous compute constraints and non-IID data. The simulation results, using the FashionMNIST dataset, demonstrate over 10% and 19% improvement in global model accuracy compared to state-of-the-art methods PAPAYA and FedAsync, respectively. Our dynamic aggregation method allows reliable global model training despite limiting client resources and statistical data heterogeneity. This improves robustness and scalability for real-world FL deployments.
翻译:联邦学习(FL)系统在处理异构设备及客户端间非同分布数据时面临性能挑战。我们提出一种在异步联邦学习(AFL)部署中的动态全局模型聚合方法以解决这些问题。该聚合方法根据客户端模型更新的上传频率对其贡献进行评分并调整权重,从而适应设备能力差异。此外,我们还在客户端上传本地模型后立即提供更新后的全局模型,以减少空闲时间并提升训练效率。我们在包含10个模拟客户端(具有异构计算约束和非IID数据)的AFL部署中评估该方法。基于FashionMNIST数据集的仿真结果表明,与最先进方法PAPAYA和FedAsync相比,全局模型准确率分别提升超过10%和19%。尽管客户端资源受限且存在统计异质性数据,我们的动态聚合方法仍能实现可靠的全局模型训练,从而增强真实FL部署的鲁棒性和可扩展性。