Efficient Semi-Supervised Federated Learning for Heterogeneous Participants

Federated Learning (FL) has emerged to allow multiple clients to collaboratively train machine learning models on their private data. However, training and deploying large models for broader applications is challenging in resource-constrained environments. Fortunately, Split Federated Learning (SFL) offers an excellent solution by alleviating the computation and communication burden on the clients SFL often assumes labeled data for local training on clients, however, it is not the case in practice.Prior works have adopted semi-supervised techniques for leveraging unlabeled data in FL, but data non-IIDness poses another challenge to ensure training efficiency. Herein, we propose Pseudo-Clustering Semi-SFL, a novel system for training models in scenarios where labeled data reside on the server. By introducing Clustering Regularization, model performance under data non-IIDness can be improved. Besides, our theoretical and experimental investigations into model convergence reveal that the inconsistent training processes on labeled and unlabeled data impact the effectiveness of clustering regularization. Upon this, we develop a control algorithm for global updating frequency adaptation, which dynamically adjusts the number of supervised training iterations to mitigate the training inconsistency. Extensive experiments on benchmark models and datasets show that our system provides a 3.3x speed-up in training time and reduces the communication cost by about 80.1% while reaching the target accuracy, and achieves up to 6.9% improvement in accuracy under non-IID scenarios compared to the state-of-the-art.

翻译：联邦学习（FL）应运而生，允许多个客户端在私有数据上协作训练机器学习模型。然而，在资源受限的环境中训练和部署大规模模型以服务于更广泛的应用仍面临挑战。幸运的是，分裂联邦学习（SFL）通过减轻客户端的计算和通信负担提供了出色的解决方案。SFL通常假设客户端拥有用于本地训练的标注数据，但这在实际中并不成立。先前的工作已在FL中采用半监督技术来利用未标注数据，但数据非独立同分布（non-IIDness）对训练效率构成了另一挑战。为此，我们提出Pseudo-Clustering Semi-SFL，一种新颖的系统，用于在标注数据驻留在服务器端的场景下训练模型。通过引入聚类正则化，可以提升数据非独立同分布下的模型性能。此外，我们对模型收敛性的理论和实验研究表明，标注数据与未标注数据上不一致的训练过程会影响聚类正则化的有效性。基于此，我们开发了一种用于全局更新频率自适应的控制算法，该算法动态调整监督训练迭代次数以缓解训练不一致性。在基准模型和数据集上的大量实验表明，我们的系统在达到目标精度的同时，训练速度提升了3.3倍，通信成本降低了约80.1%，并且在非独立同分布场景下准确率较现有最优方法提升了高达6.9%。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日