Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial Defense

Masked Image Modeling (MIM) has been a prevailing framework for self-supervised visual representation learning. Within the pretraining-finetuning paradigm, the MIM framework trains an encoder by reconstructing masked image patches with the help of a decoder which would be abandoned when the encoder is used for finetuning. Despite its state-of-the-art performance on clean images, MIM models are vulnerable to adversarial attacks, limiting its real-world application, and few studies have focused on this issue. In this paper, we have discovered that noisy image modeling (NIM), a variant of MIM that uses denoising as the pre-text task, provides not only good pretrained visual features, but also effective adversarial defense for downstream models. To achieve a better accuracy-robustness trade-off, we further propose to sample the hyperparameter that controls the reconstruction difficulty from random distributions instead of setting it globally, and fine-tune downstream networks with denoised images. Experimental results demonstrate that our pre-trained denoising autoencoders are effective against different white-box, gray-box, and black-box attacks without being trained with adversarial images, while not harming the clean accuracy of fine-tuned models. Source code and models will be made available.

翻译：掩码图像建模（Masked Image Modeling, MIM）已成为自监督视觉表示学习的主流框架。在预训练-微调范式中，MIM框架通过借助解码器重建被掩码的图像块来训练编码器，而该解码器在编码器用于微调时会被丢弃。尽管MIM模型在干净图像上取得了最先进性能，但其易受对抗攻击，限制了实际应用，而很少有研究关注这一问题。本文发现，噪声图像建模（Noisy Image Modeling, NIM）——一种以去噪为前置任务的MIM变体——不仅提供良好的预训练视觉特征，还能为下游模型提供有效的对抗防御。为了实现更好的准确率-鲁棒性权衡，我们进一步提出从随机分布中采样控制重建难度的超参数，而非全局设定，并使用去噪图像对下游网络进行微调。实验结果表明，我们预训练的去噪自编码器无需使用对抗图像训练，即可有效抵御多种白盒、灰盒及黑盒攻击，同时不损害微调模型在干净图像上的准确率。源代码和模型将公开提供。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/