DeTrigger: A Gradient-Centric Approach to Backdoor Attack Mitigation in Federated Learning

Federated Learning (FL) enables collaborative model training across distributed devices while preserving local data privacy, making it ideal for mobile and embedded systems. However, the decentralized nature of FL also opens vulnerabilities to model poisoning attacks, particularly backdoor attacks, where adversaries implant trigger patterns to manipulate model predictions. In this paper, we propose DeTrigger, a scalable and efficient backdoor-robust federated learning framework that leverages insights from adversarial attack methodologies. By employing gradient analysis with temperature scaling, DeTrigger detects and isolates backdoor triggers, allowing for precise model weight pruning of backdoor activations without sacrificing benign model knowledge. Extensive evaluations across four widely used datasets demonstrate that DeTrigger achieves up to 251x faster detection than traditional methods and mitigates backdoor attacks by up to 98.9%, with minimal impact on global model accuracy. Our findings establish DeTrigger as a robust and scalable solution to protect federated learning environments against sophisticated backdoor threats.

翻译：联邦学习（Federated Learning, FL）能够在分布式设备上进行协作式模型训练，同时保护本地数据隐私，因此非常适用于移动和嵌入式系统。然而，FL的去中心化特性也使其易受模型投毒攻击，特别是后门攻击，即攻击者植入触发模式以操纵模型预测。本文提出DeTrigger，一个可扩展且高效的后门鲁棒联邦学习框架，其借鉴了对抗攻击方法论的洞见。通过采用结合温度缩放的梯度分析，DeTrigger能够检测并隔离后门触发器，从而实现对后门激活的精确模型权重剪枝，且不牺牲良性模型知识。在四个广泛使用的数据集上进行的大量评估表明，DeTrigger的检测速度比传统方法快达251倍，并能缓解高达98.9%的后门攻击，同时对全局模型精度的影响极小。我们的研究结果确立了DeTrigger作为一种鲁棒且可扩展的解决方案，能够有效保护联邦学习环境免受复杂的后门威胁。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/