Federated Learning (FL) is a distributed machine learning approach where multiple clients work together to solve a machine learning task. One of the key challenges in FL is the issue of partial participation, which occurs when a large number of clients are involved in the training process. The traditional method to address this problem is randomly selecting a subset of clients at each communication round. In our research, we propose a new technique and design a novel regularized client participation scheme. Under this scheme, each client joins the learning process every $R$ communication rounds, which we refer to as a meta epoch. We have found that this participation scheme leads to a reduction in the variance caused by client sampling. Combined with the popular FedAvg algorithm (McMahan et al., 2017), it results in superior rates under standard assumptions. For instance, the optimization term in our main convergence bound decreases linearly with the product of the number of communication rounds and the size of the local dataset of each client, and the statistical term scales with step size quadratically instead of linearly (the case for client sampling with replacement), leading to better convergence rate $\mathcal{O}\left(\frac{1}{T^2}\right)$ compared to $\mathcal{O}\left(\frac{1}{T}\right)$, where $T$ is the total number of communication rounds. Furthermore, our results permit arbitrary client availability as long as each client is available for training once per each meta epoch.
翻译:联邦学习(Federated Learning, FL)是一种分布式机器学习方法,其中多个客户端协同完成机器学习任务。FL的关键挑战之一是部分参与问题,即当大量客户端参与训练过程时出现的问题。解决该问题的传统方法是在每轮通信中随机选择客户端子集。在本研究中,我们提出了一种新技术并设计了一种新颖的正则化客户端参与方案。在此方案下,每个客户端每$R$轮通信参与一次学习过程,我们将这$R$轮定义为一个元周期。我们发现该参与方案能够降低由客户端采样引起的方差。结合流行的FedAvg算法(McMahan等人,2017),该方案在标准假设下实现了更优的收敛速率。例如,我们主要收敛界中的优化项随通信轮次与每个客户端本地数据集大小的乘积线性递减,而统计项以步长的二次方(而非线性,后者对应于有放回客户端采样)缩放,从而获得比$\mathcal{O}\left(\frac{1}{T}\right)$更好的收敛速率$\mathcal{O}\left(\frac{1}{T^2}\right)$,其中$T$为总通信轮次。此外,只要每个客户端在每个元周期内至少参与一次训练,我们的结果允许任意客户端可用性。