Federated learning is an approach to collaboratively training machine learning models for multiple parties that prohibit data sharing. One of the challenges in federated learning is non-IID data between clients, as a single model can not fit the data distribution for all clients. Meta-learning, such as Per-FedAvg, is introduced to cope with the challenge. Meta-learning learns shared initial parameters for all clients. Each client employs gradient descent to adapt the initialization to local data distributions quickly to realize model personalization. However, due to non-convex loss function and randomness of sampling update, meta-learning approaches have unstable goals in local adaptation for the same client. This fluctuation in different adaptation directions hinders the convergence in meta-learning. To overcome this challenge, we use the historical local adapted model to restrict the direction of the inner loop and propose an elastic-constrained method. As a result, the current round inner loop keeps historical goals and adapts to better solutions. Experiments show our method boosts meta-learning convergence and improves personalization without additional calculation and communication. Our method achieved SOTA on all metrics in three public datasets.
翻译:联邦学习是一种多方协作训练机器学习模型的方法,其禁止数据共享。联邦学习面临的挑战之一是客户端间的数据非独立同分布,因为单一模型无法适配所有客户端的数据分布。为此,引入了元学习方法(如Per-FedAvg)应对该挑战。元学习为所有客户端学习共享的初始参数,每个客户端通过梯度下降将初始参数快速适配至本地数据分布,实现模型个性化。然而,由于非凸损失函数和采样更新的随机性,元学习方法在相同客户端的本地适配中存在目标不稳定性。这种不同适配方向的波动阻碍了元学习的收敛。为解决此问题,我们利用历史本地适配模型约束内循环方向,提出弹性约束方法。这使得当前轮次的内循环既能保留历史目标,又能适配更优解。实验表明,该方法无需额外计算和通信即可提升元学习收敛性并增强个性化效果。我们在三个公开数据集的所有指标上均达到了最优性能。