ACE: A Model Poisoning Attack on Contribution Evaluation Methods in Federated Learning

In Federated Learning (FL), a set of clients collaboratively train a machine learning model (called global model) without sharing their local training data. The local training data of clients is typically non-i.i.d. and heterogeneous, resulting in varying contributions from individual clients to the final performance of the global model. In response, many contribution evaluation methods were proposed, where the server could evaluate the contribution made by each client and incentivize the high-contributing clients to sustain their long-term participation in FL. Existing studies mainly focus on developing new metrics or algorithms to better measure the contribution of each client. However, the security of contribution evaluation methods of FL operating in adversarial environments is largely unexplored. In this paper, we propose the first model poisoning attack on contribution evaluation methods in FL, termed ACE. Specifically, we show that any malicious client utilizing ACE could manipulate the parameters of its local model such that it is evaluated to have a high contribution by the server, even when its local training data is indeed of low quality. We perform both theoretical analysis and empirical evaluations of ACE. Theoretically, we show our design of ACE can effectively boost the malicious client's perceived contribution when the server employs the widely-used cosine distance metric to measure contribution. Empirically, our results show ACE effectively and efficiently deceive five state-of-the-art contribution evaluation methods. In addition, ACE preserves the accuracy of the final global models on testing inputs. We also explore six countermeasures to defend ACE. Our results show they are inadequate to thwart ACE, highlighting the urgent need for new defenses to safeguard the contribution evaluation methods in FL.

翻译：在联邦学习（FL）中，一组客户端在不共享其本地训练数据的情况下协作训练一个机器学习模型（称为全局模型）。客户端的本地训练数据通常是非独立同分布且异质的，这导致各个客户端对全局模型最终性能的贡献存在差异。为此，研究者提出了许多贡献评估方法，使得服务器能够评估每个客户端所做的贡献，并激励高贡献客户端维持其长期参与联邦学习。现有研究主要集中于开发新的度量标准或算法以更好地衡量每个客户端的贡献。然而，在对抗性环境中运行的联邦学习贡献评估方法的安全性在很大程度上尚未得到探索。本文中，我们提出了首个针对联邦学习贡献评估方法的模型投毒攻击，称为ACE。具体而言，我们证明任何恶意客户端利用ACE均可操纵其本地模型的参数，使得服务器评估其具有高贡献，即使其本地训练数据实际上质量低下。我们对ACE进行了理论分析和实证评估。理论上，我们证明了当服务器采用广泛使用的余弦距离度量来衡量贡献时，我们的ACE设计能有效提升恶意客户端的感知贡献。实证结果表明，ACE能有效且高效地欺骗五种最先进的贡献评估方法。此外，ACE能保持最终全局模型在测试输入上的准确性。我们还探讨了六种防御ACE的对策。结果表明这些对策不足以阻止ACE，这凸显了迫切需要新的防御机制来保障联邦学习中的贡献评估方法。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日