Differential Privacy as a Perk: Federated Learning over Multiple-Access Fading Channels with a Multi-Antenna Base Station

Federated Learning (FL) is a distributed learning paradigm that preserves privacy by eliminating the need to exchange raw data during training. In its prototypical edge instantiation with underlying wireless transmissions enabled by analog over-the-air computing (AirComp), referred to as \emph{over-the-air FL (AirFL)}, the inherent channel noise plays a unique role of \emph{frenemy} in the sense that it degrades training due to noisy global aggregation while providing a natural source of randomness for privacy-preserving mechanisms, formally quantified by \emph{differential privacy (DP)}. It remains, nevertheless, challenging to effectively harness such channel impairments, as prior arts, under assumptions of either simple channel models or restricted types of loss functions, mostly considering (local) DP enhancement with a single-round or non-convergent bound on privacy loss. In this paper, we study AirFL over multiple-access fading channels with a multi-antenna base station (BS) subject to user-level DP requirements. Despite a recent study, which claimed in similar settings that artificial noise (AN) must be injected to ensure DP in general, we demonstrate, on the contrary, that DP can be gained as a \emph{perk} even \emph{without} employing any AN. Specifically, we derive a novel bound on DP that converges under general bounded-domain assumptions on model parameters, along with a convergence bound with general smooth and non-convex loss functions. Next, we optimize over receive beamforming and power allocations to characterize the optimal convergence-privacy trade-offs, which also reveal explicit conditions in which DP is achievable without compromising training. Finally, our theoretical findings are validated by extensive numerical results.

翻译：联邦学习（FL）是一种分布式学习范式，通过在训练过程中无需交换原始数据来保护隐私。在其典型的边缘实现中，通过模拟空中计算（AirComp）实现底层无线传输，称为空中联邦学习（AirFL），固有信道噪声扮演着独特的朋友兼敌人角色：一方面因噪声全局聚合而降低训练质量，另一方面为隐私保护机制提供了天然的随机性来源，这可由差分隐私（DP）进行形式化量化。然而，有效利用此类信道损伤仍具挑战性，因为现有技术在简单信道模型或受限损失函数类型的假设下，大多仅考虑（局部）DP增强，且仅提供单轮或非收敛的隐私损失界限。本文研究具有用户级DP要求的多天线基站（BS）下多址衰落信道上的AirFL。尽管近期研究在类似设置中声称通常必须注入人工噪声（AN）以确保DP，但我们证明，即使不采用任何AN，DP仍可作为额外收益获得。具体而言，我们推导出在模型参数一般有界域假设下收敛的新型DP界限，以及具有一般光滑非凸损失函数的收敛界限。接着，我们通过优化接收波束成形和功率分配来刻画最优的收敛-隐私权衡关系，并揭示在不影响训练前提下实现DP的显式条件。最后，我们通过大量数值结果验证了理论发现。