SCA: Highly Efficient Semantic-Consistent Unrestricted Adversarial Attack

Unrestricted adversarial attacks typically manipulate the semantic content of an image (e.g., color or texture) to create adversarial examples that are both effective and photorealistic. Recent works have utilized the diffusion inversion process to map images into a latent space, where high-level semantics are manipulated by introducing perturbations. However, they often results in substantial semantic distortions in the denoised output and suffers from low efficiency. In this study, we propose a novel framework called Semantic-Consistent Unrestricted Adversarial Attacks (SCA), which employs an inversion method to extract edit-friendly noise maps and utilizes Multimodal Large Language Model (MLLM) to provide semantic guidance throughout the process. Under the condition of rich semantic information provided by MLLM, we perform the DDPM denoising process of each step using a series of edit-friendly noise maps, and leverage DPM Solver++ to accelerate this process, enabling efficient sampling with semantic consistency. Compared to existing methods, our framework enables the efficient generation of adversarial examples that exhibit minimal discernible semantic changes. Consequently, we for the first time introduce Semantic-Consistent Adversarial Examples (SCAE). Extensive experiments and visualizations have demonstrated the high efficiency of SCA, particularly in being on average 12 times faster than the state-of-the-art attacks. Our code can be found at https://github.com/Pan-Zihao/SCA.

翻译：无限制对抗攻击通常通过操纵图像的语义内容（如颜色或纹理）来创建既有效又逼真的对抗样本。近期研究利用扩散反演过程将图像映射到潜在空间，并通过引入扰动来操纵高级语义。然而，这些方法往往导致去噪输出产生显著的语义失真，且效率低下。在本研究中，我们提出了一种名为语义一致无限制对抗攻击（SCA）的新框架，该框架采用反演方法提取易于编辑的噪声图，并利用多模态大语言模型（MLLM）在整个过程中提供语义指导。在MLLM提供丰富语义信息的条件下，我们使用一系列易于编辑的噪声图执行每一步的DDPM去噪过程，并利用DPM Solver++加速该过程，从而实现具有语义一致性的高效采样。与现有方法相比，我们的框架能够高效生成仅表现出极小可察觉语义变化的对抗样本。因此，我们首次引入了语义一致对抗样本（SCAE）。大量实验和可视化结果证明了SCA的高效性，其平均速度比最先进的攻击方法快12倍。我们的代码可在https://github.com/Pan-Zihao/SCA获取。

相关内容

SCA

关注 0

计算机动画研讨会（SCA）是计算机动画的理论和实践创新的主要论坛。SCA将从事基于时间现象的各个方面的学术研究人员和行业研究人员以及从业人员汇聚在一起，提供了一个私密的环境，鼓励社区互动，促进研究成果的交流，激发未来的想法并建立新的合作关系。官网地址：http://dblp.uni-trier.de/db/conf/sca/index.html

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日