DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks

Adversarial attacks, particularly patch attacks, pose significant threats to the robustness and reliability of deep learning models. Developing reliable defenses against patch attacks is crucial for real-world applications, yet current research in this area is unsatisfactory. In this paper, we propose DIFFender, a novel defense method that leverages a text-guided diffusion model to defend against adversarial patches. DIFFender includes two main stages: patch localization and patch restoration. In the localization stage, we find and exploit an intriguing property of the diffusion model to precisely identify the locations of adversarial patches. In the restoration stage, we employ the diffusion model to reconstruct the adversarial regions in the images while preserving the integrity of the visual content. Thanks to the former finding, these two stages can be simultaneously guided by a unified diffusion model. Thus, we can utilize the close interaction between them to improve the whole defense performance. Moreover, we propose a few-shot prompt-tuning algorithm to fine-tune the diffusion model, enabling the pre-trained diffusion model to adapt to the defense task easily. We conduct extensive experiments on image classification, face recognition, and further in the physical world, demonstrating that our proposed method exhibits superior robustness under strong adaptive attacks and generalizes well across various scenarios, diverse classifiers, and multiple patch attack methods.

翻译：对抗性攻击，特别是补丁攻击，对深度学习模型的鲁棒性和可靠性构成严重威胁。开发针对补丁攻击的可靠防御方法对于实际应用至关重要，然而当前该领域的研究仍不尽人意。本文提出DIFFender，一种利用文本引导扩散模型防御对抗性补丁的新型防御方法。DIFFender包含两个主要阶段：补丁定位与补丁恢复。在定位阶段，我们发现并利用扩散模型的一项有趣特性，精准识别对抗性补丁的位置；在恢复阶段，我们采用扩散模型重建图像中的对抗区域，同时保持视觉内容的完整性。得益于前一项发现，这两个阶段可由统一的扩散模型协同引导，从而通过二者的紧密交互提升整体防御性能。此外，我们提出一种少样本提示调优算法对扩散模型进行微调，使预训练扩散模型能够轻松适应防御任务。我们在图像分类、人脸识别以及物理世界场景中进行了广泛实验，结果表明，所提方法在强自适应攻击下展现出卓越的鲁棒性，并能在多种场景、不同分类器及多种补丁攻击方法中实现良好泛化。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日