Fed-FSNet: Mitigating Non-I.I.D. Federated Learning via Fuzzy Synthesizing Network

Federated learning (FL) has emerged as a promising privacy-preserving distributed machine learning framework recently. It aims at collaboratively learning a shared global model by performing distributed training locally on edge devices and aggregating local models into a global one without centralized raw data sharing in the cloud server. However, due to the large local data heterogeneities (Non-I.I.D. data) across edge devices, the FL may easily obtain a global model that can produce more shifted gradients on local datasets, thereby degrading the model performance or even suffering from the non-convergence during training. In this paper, we propose a novel FL training framework, dubbed Fed-FSNet, using a properly designed Fuzzy Synthesizing Network (FSNet) to mitigate the Non-I.I.D. FL at-the-source. Concretely, we maintain an edge-agnostic hidden model in the cloud server to estimate a less-accurate while direction-aware inversion of the global model. The hidden model can then fuzzily synthesize several mimic I.I.D. data samples (sample features) conditioned on only the global model, which can be shared by edge devices to facilitate the FL training towards faster and better convergence. Moreover, since the synthesizing process involves neither access to the parameters/updates of local models nor analyzing individual local model outputs, our framework can still ensure the privacy of FL. Experimental results on several FL benchmarks demonstrate that our method can significantly mitigate the Non-I.I.D. issue and obtain better performance against other representative methods.

翻译：联邦学习（FL）近期已成为一种具有前景的保护隐私的分布式机器学习框架。其目标是通过在边缘设备上本地执行分布式训练，并将局部模型聚合为全局模型（无需在云服务器集中共享原始数据），协作学习一个共享的全局模型。然而，由于跨边缘设备存在较大的本地数据异质性（非独立同分布数据），联邦学习容易获得一个对本地数据集产生更大偏移梯度的全局模型，进而降低模型性能，甚至在训练过程中出现不收敛问题。本文提出一种新颖的联邦学习训练框架Fed-FSNet，该框架采用精心设计的模糊合成网络（FSNet）从源头缓解非独立同分布联邦学习问题。具体而言，我们在云服务器中维护一个边缘不可知的隐藏模型，用于估计全局模型的低精度但方向感知的逆映射。该隐藏模型随后可仅基于全局模型模糊合成若干模拟独立同分布数据样本（样本特征），这些样本可由边缘设备共享，从而促进联邦学习训练实现更快、更优的收敛。此外，由于合成过程既无需访问局部模型的参数/更新，也无需分析单个局部模型的输出，我们的框架仍能保障联邦学习的隐私性。在多个联邦学习基准上的实验结果表明，与其它代表性方法相比，我们的方法能显著缓解非独立同分布问题并获得更优性能。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！700+ppt《因果推理》课程！杜克大学Fan Li教程

专知会员服务

73+阅读 · 2022年7月11日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日