On Feasibility of Intent Obfuscating Attacks

Intent obfuscation is a common tactic in adversarial situations, enabling the attacker to both manipulate the target system and avoid culpability. Surprisingly, it has rarely been implemented in adversarial attacks on machine learning systems. We are the first to propose using intent obfuscation to generate adversarial examples for object detectors: by perturbing another non-overlapping object to disrupt the target object, the attacker hides their intended target. We conduct a randomized experiment on 5 prominent detectors -- YOLOv3, SSD, RetinaNet, Faster R-CNN, and Cascade R-CNN -- using both targeted and untargeted attacks and achieve success on all models and attacks. We analyze the success factors characterizing intent obfuscating attacks, including target object confidence and perturb object sizes. We then demonstrate that the attacker can exploit these success factors to increase success rates for all models and attacks. Finally, we discuss main takeaways and legal repercussions.

翻译：意图混淆是对抗情境中的常见策略，使攻击者既能操纵目标系统又可规避罪责。令人惊讶的是，该策略在针对机器学习系统的对抗攻击中鲜有应用。我们首次提出利用意图混淆为物体检测器生成对抗样本：通过扰动另一个非重叠物体来干扰目标物体，攻击者借此隐藏其真实意图。我们在5个主流检测器——YOLOv3、SSD、RetinaNet、Faster R-CNN和Cascade R-CNN——上开展随机化实验，采用定向与非定向攻击策略，在所有模型和攻击类型中均取得成功。我们解析了决定意图混淆攻击成功的关键因素，包括目标物体置信度与扰动物体尺寸。随后证明攻击者可利用这些成功要素提升所有模型和攻击类型的成功率。最后，我们探讨了核心结论与法律影响。

相关内容

R-CNN

关注 26

R-CNN的全称是Region-CNN，它可以说是是第一个成功将深度学习应用到目标检测上的算法。传统的目标检测方法大多以图像识别为基础。一般可以在图片上使用穷举法选出所所有物体可能出现的区域框，对这些区域框提取特征并使用图像识别方法分类，得到所有分类成功的区域后,通过非极大值抑制(Non-maximumsuppression)输出结果。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日