Attacking the Spike: On the Transferability and Security of Spiking Neural Networks to Adversarial Examples

Spiking neural networks (SNNs) have attracted much attention for their high energy efficiency and for recent advances in their classification performance. However, unlike traditional deep learning approaches, the analysis and study of the robustness of SNNs to adversarial examples remain relatively underdeveloped. In this work, we focus on advancing the adversarial attack side of SNNs and make three major contributions. First, we show that successful white-box adversarial attacks on SNNs are highly dependent on the underlying surrogate gradient technique, even in the case of adversarially trained SNNs. Second, using the best surrogate gradient technique, we analyze the transferability of adversarial attacks on SNNs and other state-of-the-art architectures like Vision Transformers (ViTs) and Big Transfer Convolutional Neural Networks (CNNs). We demonstrate that the adversarial examples created by non-SNN architectures are not misclassified often by SNNs. Third, due to the lack of an ubiquitous white-box attack that is effective across both the SNN and CNN/ViT domains, we develop a new white-box attack, the Auto Self-Attention Gradient Attack (Auto-SAGA). Our novel attack generates adversarial examples capable of fooling both SNN and non-SNN models simultaneously. Auto-SAGA is as much as $91.1\%$ more effective on SNN/ViT model ensembles and provides a $3\times$ boost in attack effectiveness on adversarially trained SNN ensembles compared to conventional white-box attacks like Auto-PGD. Our experiments and analyses are broad and rigorous covering three datasets (CIFAR-10, CIFAR-100 and ImageNet), five different white-box attacks and nineteen classifier models (seven for each CIFAR dataset and five models for ImageNet).

翻译：脉冲神经网络（SNN）因其高能效和近年来分类性能的进步而备受关注。然而，与传统深度学习方法不同，针对SNN在对抗样本下的鲁棒性分析和研究仍相对滞后。本文聚焦于推进SNN的对抗攻击研究，并做出三项主要贡献。首先，我们发现，即使在经过对抗训练的SNN中，成功的白盒对抗攻击仍高度依赖于底层替代梯度技术。其次，基于最优的替代梯度技术，我们分析了SNN对抗攻击的迁移性，并将其与视觉Transformer（ViT）和大规模迁移卷积神经网络（CNN）等前沿架构进行对比。结果表明，非SNN架构生成的对抗样本很少导致SNN误分类。第三，由于缺乏一种在SNN和CNN/ViT领域均有效的通用白盒攻击，我们提出了一种新型白盒攻击方法——自动自注意力梯度攻击（Auto-SAGA）。该攻击能同时生成欺骗SNN和非SNN模型的对抗样本。在SNN/ViT模型集成上，Auto-SAGA的有效性相比传统白盒攻击（如Auto-PGD）提升高达91.1%，并在经过对抗训练的SNN集成上实现了3倍的攻击效果提升。我们的实验与分析覆盖三个数据集（CIFAR-10、CIFAR-100和ImageNet）、五种白盒攻击方法及十九个分类器模型（每个CIFAR数据集七个，ImageNet五个），具有广泛的严谨性。

相关内容

白盒

关注 0

白盒测试（也称为透明盒测试，玻璃盒测试，透明盒测试和结构测试）是一种软件测试方法，用于测试应用程序的内部结构或功能，而不是其功能（即黑盒测试）。在白盒测试中，系统的内部视角以及编程技能被用来设计测试用例。测试人员选择输入以遍历代码的路径并确定预期的输出。这类似于测试电路中的节点，在线测试（ICT）。白盒测试可以应用于软件测试过程的单元，集成和系统级别。尽管传统的测试人员倾向于将白盒测试视为在单元级别进行的，但如今它已越来越频繁地用于集成和系统测试。它可以测试单元内的路径，集成期间单元之间的路径以及系统级测试期间子系统之间的路径。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日