FedRand: Enhancing Privacy in Federated Learning with Randomized LoRA Subparameter Updates

Federated Learning (FL) is a widely used framework for training models in a decentralized manner, ensuring that the central server does not have direct access to data from local clients. However, this approach may still fail to fully preserve data privacy, as models from local clients are exposed to the central server during the aggregation process. This issue becomes even more critical when training vision-language models (VLMs) with FL, as VLMs can easily memorize training data instances, making them vulnerable to membership inference attacks (MIAs). To address this challenge, we propose the FedRand framework, which avoids disclosing the full set of client parameters. In this framework, each client randomly selects subparameters of Low-Rank Adaptation (LoRA) from the server and keeps the remaining counterparts of the LoRA weights as private parameters. After training both parameters on the client's private dataset, only the non-private client parameters are sent back to the server for aggregation. This approach mitigates the risk of exposing client-side VLM parameters, thereby enhancing data privacy. We empirically validate that FedRand improves robustness against MIAs compared to relevant baselines while achieving accuracy comparable to methods that communicate full LoRA parameters across several benchmark datasets.

翻译：联邦学习（FL）是一种广泛使用的去中心化模型训练框架，确保中央服务器无法直接访问本地客户端的数据。然而，这种方法仍可能无法完全保护数据隐私，因为在聚合过程中，来自本地客户端的模型会暴露给中央服务器。当使用FL训练视觉语言模型（VLM）时，这一问题变得尤为关键，因为VLM容易记忆训练数据实例，使其易受成员推理攻击（MIA）的影响。为应对这一挑战，我们提出了FedRand框架，该框架避免披露客户端的完整参数集。在此框架中，每个客户端从服务器随机选择低秩适应（LoRA）的子参数，并将LoRA权重的其余对应部分保留为私有参数。在客户端的私有数据集上训练这两类参数后，仅将非私有的客户端参数发送回服务器进行聚合。这种方法降低了暴露客户端VLM参数的风险，从而增强了数据隐私。我们通过实验验证，与相关基线方法相比，FedRand在多个基准数据集上实现了与传输完整LoRA参数方法相当的准确性的同时，提升了对MIA的鲁棒性。

相关内容

服务器

关注 14

服务器，也称伺服器，是提供计算服务的设备。由于服务器需要响应服务请求，并进行处理，因此一般来说服务器应具备承担服务并且保障服务的能力。
服务器的构成包括处理器、硬盘、内存、系统总线等，和通用的计算机架构类似，但是由于需要提供高可靠的服务，因此在处理能力、稳定性、可靠性、安全性、可扩展性、可管理性等方面要求较高。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日