Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models

Deep neural networks (DNNs) have achieved tremendous success in many remote sensing (RS) applications, in which DNNs are vulnerable to adversarial perturbations. Unfortunately, current adversarial defense approaches in RS studies usually suffer from performance fluctuation and unnecessary re-training costs due to the need for prior knowledge of the adversarial perturbations among RS data. To circumvent these challenges, we propose a universal adversarial defense approach in RS imagery (UAD-RS) using pre-trained diffusion models to defend the common DNNs against multiple unknown adversarial attacks. Specifically, the generative diffusion models are first pre-trained on different RS datasets to learn generalized representations in various data domains. After that, a universal adversarial purification framework is developed using the forward and reverse process of the pre-trained diffusion models to purify the perturbations from adversarial samples. Furthermore, an adaptive noise level selection (ANLS) mechanism is built to capture the optimal noise level of the diffusion model that can achieve the best purification results closest to the clean samples according to their Frechet Inception Distance (FID) in deep feature space. As a result, only a single pre-trained diffusion model is needed for the universal purification of adversarial samples on each dataset, which significantly alleviates the re-training efforts and maintains high performance without prior knowledge of the adversarial perturbations. Experiments on four heterogeneous RS datasets regarding scene classification and semantic segmentation verify that UAD-RS outperforms state-of-the-art adversarial purification approaches with a universal defense against seven commonly existing adversarial perturbations. Codes and the pre-trained models are available online (https://github.com/EricYu97/UAD-RS).

翻译：深度神经网络（DNN）在众多遥感（RS）应用中取得了巨大成功，但DNN易受对抗性扰动影响。当前遥感研究中的对抗防御方法通常因需要对遥感数据中对抗性扰动的先验知识而面临性能波动和不必要的重训练成本。为解决这些挑战，我们提出了一种遥感影像通用对抗防御方法（UAD-RS），利用预训练扩散模型保护通用DNN免受多种未知对抗攻击。具体而言，首先在不同遥感数据集上预训练生成式扩散模型，以学习各数据域中的泛化表示。随后，利用预训练扩散模型的前向与逆向过程构建通用对抗净化框架，以消除对抗样本中的扰动。此外，建立自适应噪声水平选择（ANLS）机制，根据深度特征空间中的Frechet初始距离（FID）捕捉扩散模型的最优噪声水平，从而获得最接近干净样本的最佳净化结果。因此，每个数据集仅需单个预训练扩散模型即可实现对抗样本的通用净化，显著减轻重训练负担，并在无需对抗扰动先验知识的情况下保持高性能。在四个异构遥感数据集上的场景分类与语义分割实验验证了UAD-RS在通用防御七种常见对抗扰动方面优于现有最先进的对抗净化方法。代码与预训练模型已在线公开（https://github.com/EricYu97/UAD-RS）。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/