CosPGD: a unified white-box adversarial attack for pixel-wise prediction tasks

While neural networks allow highly accurate predictions in many tasks, their lack of robustness towards even slight input perturbations hampers their deployment in many real-world applications. Recent research towards evaluating the robustness of neural networks such as the seminal projected gradient descent(PGD) attack and subsequent works have drawn significant attention, as they provide an effective insight into the quality of representations learned by the network. However, these methods predominantly focus on image classification tasks, while only a few approaches specifically address the analysis of pixel-wise prediction tasks such as semantic segmentation, optical flow, disparity estimation, and others, respectively. Thus, there is a lack of a unified adversarial robustness benchmarking tool(algorithm) that is applicable to all such pixel-wise prediction tasks. In this work, we close this gap and propose CosPGD, a novel white-box adversarial attack that allows optimizing dedicated attacks for any pixel-wise prediction task in a unified setting. It leverages the cosine similarity between the distributions over the predictions and ground truth (or target) to extend directly from classification tasks to regression settings. We outperform the SotA on semantic segmentation attacks in our experiments on PASCAL VOC2012 and CityScapes. Further, we set a new benchmark for adversarial attacks on optical flow, and image restoration displaying the ability to extend to any pixel-wise prediction task.

翻译：尽管神经网络在许多任务中能够实现高精度预测，但其对微小输入扰动的鲁棒性不足，阻碍了其在众多实际应用中的部署。近年来，诸如经典投影梯度下降攻击（PGD）及其后续工作等评估神经网络鲁棒性的研究引起了广泛关注，因为它们为网络学习到的表示质量提供了有效见解。然而，这些方法主要聚焦于图像分类任务，仅有少数方法专门针对语义分割、光流估计、视差估计等像素级预测任务进行分析。因此，目前缺乏一种适用于所有此类像素级预测任务的统一对抗鲁棒性基准测试工具（算法）。在本工作中，我们弥补了这一空白，提出了CosPGD——一种新颖的白盒对抗攻击方法，能够在统一框架下为任意像素级预测任务优化专用攻击。该方法利用预测分布与真实标签（或目标）分布之间的余弦相似度，将攻击从分类任务直接扩展到回归任务。在PASCAL VOC2012和CityScapes数据集上的实验中，我们在语义分割攻击方面超越了现有技术水平。此外，我们为光流估计和图像恢复任务设立了对抗攻击的新基准，展示了该方法向任意像素级预测任务扩展的能力。

相关内容

白盒

关注 0

白盒测试（也称为透明盒测试，玻璃盒测试，透明盒测试和结构测试）是一种软件测试方法，用于测试应用程序的内部结构或功能，而不是其功能（即黑盒测试）。在白盒测试中，系统的内部视角以及编程技能被用来设计测试用例。测试人员选择输入以遍历代码的路径并确定预期的输出。这类似于测试电路中的节点，在线测试（ICT）。白盒测试可以应用于软件测试过程的单元，集成和系统级别。尽管传统的测试人员倾向于将白盒测试视为在单元级别进行的，但如今它已越来越频繁地用于集成和系统测试。它可以测试单元内的路径，集成期间单元之间的路径以及系统级测试期间子系统之间的路径。

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日