CrowdGuard: Federated Backdoor Detection in Federated Learning

from arxiv, To appear in the Network and Distributed System Security (NDSS) Symposium 2024. Phillip Rieger and Torsten Krau{\ss} contributed equally to this contribution. 19 pages, 8 figures, 5 tables, 4 algorithms, 5 equations

Federated Learning (FL) is a promising approach enabling multiple clients to train Deep Neural Networks (DNNs) collaboratively without sharing their local training data. However, FL is susceptible to backdoor (or targeted poisoning) attacks. These attacks are initiated by malicious clients who seek to compromise the learning process by introducing specific behaviors into the learned model that can be triggered by carefully crafted inputs. Existing FL safeguards have various limitations: They are restricted to specific data distributions or reduce the global model accuracy due to excluding benign models or adding noise, are vulnerable to adaptive defense-aware adversaries, or require the server to access local models, allowing data inference attacks. This paper presents a novel defense mechanism, CrowdGuard, that effectively mitigates backdoor attacks in FL and overcomes the deficiencies of existing techniques. It leverages clients' feedback on individual models, analyzes the behavior of neurons in hidden layers, and eliminates poisoned models through an iterative pruning scheme. CrowdGuard employs a server-located stacked clustering scheme to enhance its resilience to rogue client feedback. The evaluation results demonstrate that CrowdGuard achieves a 100% True-Positive-Rate and True-Negative-Rate across various scenarios, including IID and non-IID data distributions. Additionally, CrowdGuard withstands adaptive adversaries while preserving the original performance of protected models. To ensure confidentiality, CrowdGuard uses a secure and privacy-preserving architecture leveraging Trusted Execution Environments (TEEs) on both client and server sides.

翻译：联邦学习（FL）是一种有前景的方法，允许多个客户端在不共享本地训练数据的情况下协同训练深度神经网络（DNN）。然而，FL易受后门（或定向投毒）攻击。这些攻击由恶意客户端发起，旨在通过在所学模型中引入可由精心构造的输入触发的特定行为，来破坏学习过程。现有的FL防护措施存在各种局限性：它们局限于特定的数据分布，或因排除良性模型或添加噪声而降低全局模型精度，易受自适应防御感知的对手攻击，或需服务器访问本地模型从而引发数据推断攻击。本文提出一种新型防御机制CrowdGuard，它能有效缓解FL中的后门攻击，并克服现有技术的缺陷。该方法利用客户端对单个模型的反馈，分析隐藏层中神经元的行为，并通过迭代剪枝方案消除中毒模型。CrowdGuard采用基于服务器的堆叠聚类方案，以增强其对恶意客户端反馈的鲁棒性。评估结果表明，CrowdGuard在各种场景（包括IID和非IID数据分布）下均能实现100%的真阳性率和真阴性率。此外，CrowdGuard能抵御自适应对手，同时保持受保护模型的原始性能。为确保机密性，CrowdGuard采用安全且隐私保护的架构，利用客户端和服务器两侧的可信执行环境（TEE）。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日