Federated Few-shot Learning

Federated Learning (FL) enables multiple clients to collaboratively learn a machine learning model without exchanging their own local data. In this way, the server can exploit the computational power of all clients and train the model on a larger set of data samples among all clients. Although such a mechanism is proven to be effective in various fields, existing works generally assume that each client preserves sufficient data for training. In practice, however, certain clients may only contain a limited number of samples (i.e., few-shot samples). For example, the available photo data taken by a specific user with a new mobile device is relatively rare. In this scenario, existing FL efforts typically encounter a significant performance drop on these clients. Therefore, it is urgent to develop a few-shot model that can generalize to clients with limited data under the FL scenario. In this paper, we refer to this novel problem as federated few-shot learning. Nevertheless, the problem remains challenging due to two major reasons: the global data variance among clients (i.e., the difference in data distributions among clients) and the local data insufficiency in each client (i.e., the lack of adequate local data for training). To overcome these two challenges, we propose a novel federated few-shot learning framework with two separately updated models and dedicated training strategies to reduce the adverse impact of global data variance and local data insufficiency. Extensive experiments on four prevalent datasets that cover news articles and images validate the effectiveness of our framework compared with the state-of-the-art baselines. Our code is provided at https://github.com/SongW-SW/F2L.

翻译：联邦学习（Federated Learning, FL）使多个客户端能够在不交换本地数据的情况下协作学习机器学习模型。通过这种方式，服务器可以利用所有客户端的计算能力，并在更大规模的数据样本集上训练模型。尽管这种机制已被证明在各个领域行之有效，但现有工作通常假设每个客户端都拥有充足的数据用于训练。然而在实践中，某些客户端可能仅包含有限数量的样本（即小样本）。例如，特定用户使用新移动设备拍摄的照片数据相对稀少。在此场景下，现有联邦学习方法往往在这些客户端上出现显著的性能下降。因此，亟需开发一种能够在联邦学习场景下泛化到数据有限客户端的小样本模型。本文将此新问题称为联邦小样本学习。然而，该问题仍面临两大挑战：客户端间的全局数据差异（即各客户端数据分布差异）和每个客户端的本地数据不足（即缺乏足够的本地训练数据）。为克服这两个挑战，我们提出了一种新颖的联邦小样本学习框架，该框架包含两个独立更新的模型和专用训练策略，以降低全局数据差异和本地数据不足带来的不利影响。在涵盖新闻文章和图像的四个主流数据集上进行的广泛实验验证了我们的框架相比现有最优基线方法的有效性。我们的代码已开源在 https://github.com/SongW-SW/F2L。

相关内容

小样本学习

关注 216

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日