RNN-Guard: Certified Robustness Against Multi-frame Attacks for Recurrent Neural Networks - 专知论文

会员服务 ·

0

RNN · 攻击 · 鲁棒 · 扰动 · 循环神经网络 ·

2023 年 4 月 17 日

RNN-Guard: Certified Robustness Against Multi-frame Attacks for Recurrent Neural Networks

翻译：RNN-Guard：针对循环神经网络多帧攻击的可认证鲁棒性

Yunruo Zhang,Tianyu Du,Shouling Ji,Peng Tang,Shanqing Guo

from arxiv, 13 pages, 7 figures, 6 tables

It is well-known that recurrent neural networks (RNNs), although widely used, are vulnerable to adversarial attacks including one-frame attacks and multi-frame attacks. Though a few certified defenses exist to provide guaranteed robustness against one-frame attacks, we prove that defending against multi-frame attacks remains a challenging problem due to their enormous perturbation space. In this paper, we propose the first certified defense against multi-frame attacks for RNNs called RNN-Guard. To address the above challenge, we adopt the perturb-all-frame strategy to construct perturbation spaces consistent with those in multi-frame attacks. However, the perturb-all-frame strategy causes a precision issue in linear relaxations. To address this issue, we introduce a novel abstract domain called InterZono and design tighter relaxations. We prove that InterZono is more precise than Zonotope yet carries the same time complexity. Experimental evaluations across various datasets and model structures show that the certified robust accuracy calculated by RNN-Guard with InterZono is up to 2.18 times higher than that with Zonotope. In addition, we extend RNN-Guard as the first certified training method against multi-frame attacks to directly enhance RNNs' robustness. The results show that the certified robust accuracy of models trained with RNN-Guard against multi-frame attacks is 15.47 to 67.65 percentage points higher than those with other training methods.

翻译：众所周知，循环神经网络虽应用广泛，却易受包括单帧攻击与多帧攻击在内的对抗性攻击影响。尽管已有少数可认证防御方法能针对单帧攻击提供鲁棒性保证，但本文证明，由于多帧攻击扰动空间巨大，防御此类攻击仍具挑战性。为此，我们提出首个面向循环神经网络多帧攻击的可认证防御方法——RNN-Guard。为应对上述挑战，我们采用"全帧扰动"策略构建与多帧攻击一致的扰动空间。然而，该策略会导致线性松弛的精度问题。我们通过引入名为InterZono的新型抽象域并设计更紧致的松弛方法来解决该问题，并证明InterZono在保持与Zonotope相同时间复杂度的前提下具有更高精度。跨数据集与模型结构的实验评估表明，采用InterZono的RNN-Guard计算的可认证鲁棒准确率比采用Zonotope的方法最高提升2.18倍。此外，我们将RNN-Guard扩展为首个面向多帧攻击的可认证训练方法，以直接增强循环神经网络的鲁棒性。结果表明，经RNN-Guard训练的模型在抵御多帧攻击时，其可认证鲁棒准确率相较其他训练方法提升15.47至67.65个百分点。

0

相关内容

RNN

RNN:循环神经网络，是深度学习的一种模型。

【Google AI】鲁棒图神经网络，Robust Graph Neural Networks

【Google AI】鲁棒图神经网络，Robust Graph Neural Networks

专知会员服务

38+阅读 · 2022年3月9日

【AAAI2021】组合对抗攻击

【AAAI2021】组合对抗攻击

专知会员服务

51+阅读 · 2021年2月17日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

联邦学习中的隐私和鲁棒性:攻击和防御, 杨强等学者最新综述论文，16页pdf

联邦学习中的隐私和鲁棒性:攻击和防御, 杨强等学者最新综述论文，16页pdf

专知会员服务

104+阅读 · 2021年2月3日

【AAAI2021】属性引导对抗训练的自然扰动鲁棒性

专知会员服务

26+阅读 · 2021年1月21日

【斯坦福】距离编码-为结构表示学习设计更强大的GNN.

专知会员服务

45+阅读 · 2020年9月3日

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

专知会员服务

80+阅读 · 2020年8月24日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

【ICCV 2019 Workshop】Adaptive Confidence Smoothing for Generalized Zero-Shot Learning，巴伊兰大学 Yuval Atzmon

【ICCV 2019 Workshop】Adaptive Confidence Smoothing for Generalized Zero-Shot Learning，巴伊兰大学 Yuval Atzmon

专知会员服务

13+阅读 · 2019年10月31日

【苏黎世联邦理工博士论文】深度神经网络的鲁棒性与正则化，233页pdf

【苏黎世联邦理工博士论文】深度神经网络的鲁棒性与正则化，233页pdf

专知

2+阅读 · 2022年11月4日

ICLR 2022 Spotlight | MSU联合MIT-IBM提出首个黑箱防御框架

ICLR 2022 Spotlight | MSU联合MIT-IBM提出首个黑箱防御框架

PaperWeekly

0+阅读 · 2022年9月16日

AAAI 2022 | 对抗攻击鲁棒的异质图神经网络

AAAI 2022 | 对抗攻击鲁棒的异质图神经网络

PaperWeekly

1+阅读 · 2022年8月16日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

专知

11+阅读 · 2020年3月17日

视频目标检测：Flow-based

视频目标检测：Flow-based

极市平台

22+阅读 · 2019年5月27日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

专知

13+阅读 · 2017年10月17日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

星形胶质细胞Connexin43在大脑皮层梗死继发丘脑变性中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

大规模在线游戏网络用户行为研究

国家自然科学基金

2+阅读 · 2015年12月31日

HEVC的低复杂度和并行编码方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

动态和多元非参数控制图的研究与应用

国家自然科学基金

0+阅读 · 2012年12月31日

融合推荐攻击在线集成检测和多维信任机制的可信推荐模型及关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于视觉显著内容的图像半脆弱自恢复水印算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

鲁棒性压缩感知关键技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

SSeCKS通过HSPA12B影响NF-kappa B的活性在星形胶质细胞炎性激活中的意义

国家自然科学基金

0+阅读 · 2011年12月31日

基于糖抗原STn和GM3的抗肿瘤疫苗研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于水传递的质子交换膜燃料电池动态响应过程模拟

国家自然科学基金

0+阅读 · 2009年12月31日

Does Black-box Attribute Inference Attacks on Graph Neural Networks Constitute Privacy Risk?

Arxiv

0+阅读 · 2023年6月1日

GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles

Arxiv

0+阅读 · 2023年6月1日

Graph-based methods coupled with specific distributional distances for adversarial attack detection

Arxiv

0+阅读 · 2023年5月31日

Neural Markov Jump Processes

Arxiv

0+阅读 · 2023年5月31日

Adversarial Detection: Attacking Object Detection in Real Time

Arxiv

0+阅读 · 2023年5月31日

Adversarial Driving: Attacking End-to-End Autonomous Driving

Arxiv

0+阅读 · 2023年5月31日

Backdoor Attacks Against Incremental Learners: An Empirical Evaluation Study

Arxiv

0+阅读 · 2023年5月28日

Graph Neural Network for Traffic Forecasting: A Survey

Arxiv

35+阅读 · 2021年1月27日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

VIP会员

文章信息

相关主题

循环神经网络

最新内容

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

2+阅读 · 6月23日

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

4+阅读 · 6月23日

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

7+阅读 · 6月23日

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

3+阅读 · 6月23日

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

4+阅读 · 6月23日

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

6+阅读 · 6月23日

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

5+阅读 · 6月23日

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

3+阅读 · 6月23日

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

6+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

8+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

8+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

6+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

9+阅读 · 6月22日

相关VIP内容

【Google AI】鲁棒图神经网络，Robust Graph Neural Networks

【Google AI】鲁棒图神经网络，Robust Graph Neural Networks

专知会员服务

38+阅读 · 2022年3月9日

【AAAI2021】组合对抗攻击

【AAAI2021】组合对抗攻击

专知会员服务

51+阅读 · 2021年2月17日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

联邦学习中的隐私和鲁棒性:攻击和防御, 杨强等学者最新综述论文，16页pdf

联邦学习中的隐私和鲁棒性:攻击和防御, 杨强等学者最新综述论文，16页pdf

专知会员服务

104+阅读 · 2021年2月3日

【AAAI2021】属性引导对抗训练的自然扰动鲁棒性

专知会员服务

26+阅读 · 2021年1月21日

【斯坦福】距离编码-为结构表示学习设计更强大的GNN.

专知会员服务

45+阅读 · 2020年9月3日

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

专知会员服务

80+阅读 · 2020年8月24日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

【ICCV 2019 Workshop】Adaptive Confidence Smoothing for Generalized Zero-Shot Learning，巴伊兰大学 Yuval Atzmon

【ICCV 2019 Workshop】Adaptive Confidence Smoothing for Generalized Zero-Shot Learning，巴伊兰大学 Yuval Atzmon

专知会员服务

13+阅读 · 2019年10月31日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 世界动作模型：少做梦，多行动

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

美以伊冲突：无人机与人工智能的运用

相关资讯

【苏黎世联邦理工博士论文】深度神经网络的鲁棒性与正则化，233页pdf

【苏黎世联邦理工博士论文】深度神经网络的鲁棒性与正则化，233页pdf

专知

2+阅读 · 2022年11月4日

ICLR 2022 Spotlight | MSU联合MIT-IBM提出首个黑箱防御框架

ICLR 2022 Spotlight | MSU联合MIT-IBM提出首个黑箱防御框架

PaperWeekly

0+阅读 · 2022年9月16日

AAAI 2022 | 对抗攻击鲁棒的异质图神经网络

AAAI 2022 | 对抗攻击鲁棒的异质图神经网络

PaperWeekly

1+阅读 · 2022年8月16日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

专知

11+阅读 · 2020年3月17日

视频目标检测：Flow-based

视频目标检测：Flow-based

极市平台

22+阅读 · 2019年5月27日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

专知

13+阅读 · 2017年10月17日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Does Black-box Attribute Inference Attacks on Graph Neural Networks Constitute Privacy Risk?

Arxiv

0+阅读 · 2023年6月1日

GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles

Arxiv

0+阅读 · 2023年6月1日

Graph-based methods coupled with specific distributional distances for adversarial attack detection

Arxiv

0+阅读 · 2023年5月31日

Neural Markov Jump Processes

Arxiv

0+阅读 · 2023年5月31日

Adversarial Detection: Attacking Object Detection in Real Time

Arxiv

0+阅读 · 2023年5月31日

Adversarial Driving: Attacking End-to-End Autonomous Driving

Arxiv

0+阅读 · 2023年5月31日

Backdoor Attacks Against Incremental Learners: An Empirical Evaluation Study

Arxiv

0+阅读 · 2023年5月28日

Graph Neural Network for Traffic Forecasting: A Survey

Arxiv

35+阅读 · 2021年1月27日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

相关基金

星形胶质细胞Connexin43在大脑皮层梗死继发丘脑变性中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

大规模在线游戏网络用户行为研究

国家自然科学基金

2+阅读 · 2015年12月31日

HEVC的低复杂度和并行编码方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

动态和多元非参数控制图的研究与应用

国家自然科学基金

0+阅读 · 2012年12月31日

融合推荐攻击在线集成检测和多维信任机制的可信推荐模型及关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于视觉显著内容的图像半脆弱自恢复水印算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

鲁棒性压缩感知关键技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

SSeCKS通过HSPA12B影响NF-kappa B的活性在星形胶质细胞炎性激活中的意义

国家自然科学基金

0+阅读 · 2011年12月31日

基于糖抗原STn和GM3的抗肿瘤疫苗研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于水传递的质子交换膜燃料电池动态响应过程模拟

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员