Federated Few-shot Learning

Federated Learning (FL) enables multiple clients to collaboratively learn a machine learning model without exchanging their own local data. In this way, the server can exploit the computational power of all clients and train the model on a larger set of data samples among all clients. Although such a mechanism is proven to be effective in various fields, existing works generally assume that each client preserves sufficient data for training. In practice, however, certain clients may only contain a limited number of samples (i.e., few-shot samples). For example, the available photo data taken by a specific user with a new mobile device is relatively rare. In this scenario, existing FL efforts typically encounter a significant performance drop on these clients. Therefore, it is urgent to develop a few-shot model that can generalize to clients with limited data under the FL scenario. In this paper, we refer to this novel problem as \emph{federated few-shot learning}. Nevertheless, the problem remains challenging due to two major reasons: the global data variance among clients (i.e., the difference in data distributions among clients) and the local data insufficiency in each client (i.e., the lack of adequate local data for training). To overcome these two challenges, we propose a novel federated few-shot learning framework with two separately updated models and dedicated training strategies to reduce the adverse impact of global data variance and local data insufficiency. Extensive experiments on four prevalent datasets that cover news articles and images validate the effectiveness of our framework compared with the state-of-the-art baselines. Our code is provided\footnote{\href{https://github.com/SongW-SW/F2L}{https://github.com/SongW-SW/F2L}}.

翻译：联邦学习（Federated Learning, FL）使多个客户端能够在不交换本地数据的情况下协同训练机器学习模型。通过这种方式，服务器可以充分利用所有客户端的计算能力，并在各客户端更大的数据样本集上训练模型。尽管该机制已在多个领域被证明有效，但现有研究通常假设每个客户端拥有充足的数据用于训练。然而，在实际场景中，某些客户端可能仅包含有限数量的样本（即小样本）。例如，特定用户使用新移动设备拍摄的可利用照片数据相对稀缺。在此情况下，现有联邦学习方法通常在这些客户端上遭遇显著的性能下降。因此，亟需开发一种能够在联邦场景下泛化到数据有限客户端的小样本模型。本文将此新问题定义为“联邦小样本学习”。然而，该问题仍面临两大挑战：客户端之间的全局数据差异（即各客户端数据分布的不同）以及每个客户端的本地数据不足（即缺乏足够的本地训练数据）。为克服这两个挑战，我们提出了一种新颖的联邦小样本学习框架，该框架包含两个独立更新的模型及专用训练策略，以降低全局数据差异和本地数据不足带来的负面影响。在涵盖新闻文章和图像的四个主流数据集上进行的大量实验验证了我们的框架相较于现有最先进基线的有效性。我们的代码已提供（https://github.com/SongW-SW/F2L）。

相关内容

小样本学习

关注 216

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日