ADBA:Approximation Decision Boundary Approach for Black-Box Adversarial Attacks

Many machine learning models are susceptible to adversarial attacks, with decision-based black-box attacks representing the most critical threat in real-world applications. These attacks are extremely stealthy, generating adversarial examples using hard labels obtained from the target machine learning model. This is typically realized by optimizing perturbation directions, guided by decision boundaries identified through query-intensive exact search, significantly limiting the attack success rate. This paper introduces a novel approach using the Approximation Decision Boundary (ADB) to efficiently and accurately compare perturbation directions without precisely determining decision boundaries. The effectiveness of our ADB approach (ADBA) hinges on promptly identifying suitable ADB, ensuring reliable differentiation of all perturbation directions. For this purpose, we analyze the probability distribution of decision boundaries, confirming that using the distribution's median value as ADB can effectively distinguish different perturbation directions, giving rise to the development of the ADBA-md algorithm. ADBA-md only requires four queries on average to differentiate any pair of perturbation directions, which is highly query-efficient. Extensive experiments on six well-known image classifiers clearly demonstrate the superiority of ADBA and ADBA-md over multiple state-of-the-art black-box attacks. The source code is available at https://github.com/BUPTAIOC/ADBA.

翻译：许多机器学习模型易受对抗攻击，其中基于决策的黑盒攻击是现实应用中最具威胁的攻击形式。这类攻击极具隐蔽性，仅利用目标机器学习模型输出的硬标签生成对抗样本。现有方法通常通过优化扰动方向实现攻击，其优化过程依赖查询密集的精确搜索来定位决策边界，这严重制约了攻击成功率。本文提出一种利用近似决策边界（ADB）的新方法，无需精确定位决策边界即可高效准确地比较扰动方向。我们提出的ADB方法（ADBA）的有效性关键在于快速识别合适的ADB，以确保可靠区分所有扰动方向。为此，我们分析了决策边界的概率分布，证实采用分布中值作为ADB可有效区分不同扰动方向，据此开发了ADBA-md算法。ADBA-md平均仅需四次查询即可区分任意扰动方向对，具有极高的查询效率。在六个知名图像分类器上的大量实验表明，ADBA与ADBA-md算法在多个先进黑盒攻击方法中具有显著优势。源代码公开于https://github.com/BUPTAIOC/ADBA。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日