Attack-SAM: Towards Attacking Segment Anything Model With Adversarial Examples

Segment Anything Model (SAM) has attracted significant attention recently, due to its impressive performance on various downstream tasks in a zero-short manner. Computer vision (CV) area might follow the natural language processing (NLP) area to embark on a path from task-specific vision models toward foundation models. However, deep vision models are widely recognized as vulnerable to adversarial examples, which fool the model to make wrong predictions with imperceptible perturbation. Such vulnerability to adversarial attacks causes serious concerns when applying deep models to security-sensitive applications. Therefore, it is critical to know whether the vision foundation model SAM can also be fooled by adversarial attacks. To the best of our knowledge, our work is the first of its kind to conduct a comprehensive investigation on how to attack SAM with adversarial examples. With the basic attack goal set to mask removal, we investigate the adversarial robustness of SAM in the full white-box setting and transfer-based black-box settings. Beyond the basic goal of mask removal, we further investigate and find that it is possible to generate any desired mask by the adversarial attack.

翻译：分割一切模型（SAM）凭借其在多种下游任务中零样本方式下的卓越性能，近期引起了广泛关注。计算机视觉领域可能正追随自然语言处理领域的步伐，从任务特定视觉模型迈向基础模型。然而，深度视觉模型被广泛认为易受对抗样本攻击，这类攻击通过难以察觉的扰动使模型做出错误预测。这种对对抗攻击的脆弱性在将深度模型应用于安全敏感型场景时引发了严重担忧。因此，探究视觉基础模型SAM是否也会被对抗样本攻击所欺骗至关重要。据我们所知，本研究首次系统性地探究了如何利用对抗样本攻击SAM。以基本攻击目标设定为掩码移除，我们在完全白盒设置和基于迁移的黑盒设置下考察了SAM的对抗鲁棒性。在基本掩码移除目标之外，我们进一步发现通过对抗攻击生成任意目标掩码是可行的。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

【如何做研究】How to research ，22页ppt

专知会员服务

114+阅读 · 2021年4月17日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日