Segment (Almost) Nothing: Prompt-Agnostic Adversarial Attacks on Segmentation Models

General purpose segmentation models are able to generate (semantic) segmentation masks from a variety of prompts, including visual (points, boxed, etc.) and textual (object names) ones. In particular, input images are pre-processed by an image encoder to obtain embedding vectors which are later used for mask predictions. Existing adversarial attacks target the end-to-end tasks, i.e. aim at altering the segmentation mask predicted for a specific image-prompt pair. However, this requires running an individual attack for each new prompt for the same image. We propose instead to generate prompt-agnostic adversarial attacks by maximizing the $\ell_2$-distance, in the latent space, between the embedding of the original and perturbed images. Since the encoding process only depends on the image, distorted image representations will cause perturbations in the segmentation masks for a variety of prompts. We show that even imperceptible $\ell_\infty$-bounded perturbations of radius $\epsilon=1/255$ are often sufficient to drastically modify the masks predicted with point, box and text prompts by recently proposed foundation models for segmentation. Moreover, we explore the possibility of creating universal, i.e. non image-specific, attacks which can be readily applied to any input without further computational cost.

翻译：通用分割模型能够从多种提示（包括视觉提示（点、框等）和文本提示（对象名称））生成（语义）分割掩码。具体而言，输入图像通过图像编码器进行预处理以获得嵌入向量，这些向量随后用于掩码预测。现有的对抗攻击针对端到端任务，即旨在改变特定图像-提示对所预测的分割掩码。然而，这需要对同一图像新的提示执行单独的攻击。我们提出通过最大化潜在空间中原始图像与扰动图像嵌入之间的 $\ell_2$ 距离来生成提示无关的对抗攻击。由于编码过程仅依赖于图像，扭曲的图像表示将导致多种提示下分割掩码的扰动。我们证明，即使是半径为 $\epsilon=1/255$ 的不可感知的 $\ell_\infty$ 有界扰动，也通常足以显著修改最近提出的分割基础模型在用点、框和文本提示预测的掩码。此外，我们探索了创建通用（即非图像特定）攻击的可能性，这种攻击可以随时应用于任何输入而无需额外计算成本。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日