Getting a-Round Guarantees: Floating-Point Attacks on Certified Robustness - 专知论文

会员服务 ·

0

线性的 · 稳健性 · MNIST (数据集) · Networking · 样例 ·

2023 年 2 月 2 日

Getting a-Round Guarantees: Floating-Point Attacks on Certified Robustness

翻译：关于圆的保证：针对认证鲁棒性的浮点数攻击

Jiankai Jin,Olga Ohrimenko,Benjamin I. P. Rubinstein

Adversarial examples pose a security risk as they can alter decisions of a machine learning classifier through slight input perturbations. Certified robustness has been proposed as a mitigation where given an input $x$, a classifier returns a prediction and a radius with a provable guarantee that any perturbation to $x$ within this radius (e.g., under the $L_2$ norm) will not alter the classifier's prediction. In this work, we show that these guarantees can be invalidated due to limitations of floating-point representation that cause rounding errors. We design a rounding search method that can efficiently exploit this vulnerability to find adversarial examples within the certified radius. We show that the attack can be carried out against several linear classifiers that have exact certifiable guarantees and against neural networks with ReLU activations that have conservative certifiable guarantees. Our experiments demonstrate attack success rates over 50% on random linear classifiers, up to 23.24% on the MNIST dataset for linear SVM, and up to 15.83% on the MNIST dataset for a neural network whose certified radius was given by a verifier based on mixed integer programming. Finally, as a mitigation, we advocate the use of rounded interval arithmetic to account for rounding errors.

翻译：对抗样本通过微小的输入扰动就能改变机器学习分类器的决策，构成安全风险。认证鲁棒性被提出作为缓解措施：给定输入$x$，分类器返回预测结果和半径，并提供可验证的保证，即该半径内对$x$的任何扰动（例如在$L_2$范数下）不会改变分类器的预测。本研究表明，由于浮点数表示的限制导致的舍入误差，这些保证可能失效。我们设计了一种舍入搜索方法，能够高效利用这一漏洞，在认证半径内找到对抗样本。实验证明，该攻击可成功实施于具有精确认证保证的线性分类器，以及具有保守认证保证的ReLU激活神经网络。在随机线性分类器上攻击成功率超过50%，在MNIST数据集上对线性SVM达到23.24%，对基于混合整数规划验证器给定认证半径的神经网络达到15.83%。最后，作为缓解措施，我们主张使用舍入区间算术来考虑舍入误差的影响。

0

相关内容

线性的

不可错过！CMU《机器学习导论》2023课程，Matt Gormley带队讲授，附Slides

不可错过！CMU《机器学习导论》2023课程，Matt Gormley带队讲授，附Slides

专知会员服务

38+阅读 · 2023年2月7日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

46+阅读 · 2020年10月31日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

116+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【NeurIPS2019】基于累加噪声的对抗鲁棒性（Certified Adversarial Robustness with Additive Noise），Changyou Chen

【NeurIPS2019】基于累加噪声的对抗鲁棒性（Certified Adversarial Robustness with Additive Noise），Changyou Chen

专知会员服务

36+阅读 · 2019年11月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

GPC3调控肝细胞癌侵袭和转移的信号通路研究

国家自然科学基金

0+阅读 · 2014年12月31日

电力设备tanδ在线监测中的信号去噪

国家自然科学基金

0+阅读 · 2013年12月31日

流体压缩性对微机电系统封装喷射泵点胶特性的影响机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

光纤激光传感阵列中的相干坍塌及抑制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

雷帕霉素通过Foxo3调节DCs耐受性抑制移植后肿瘤生长的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新因子hARAP3在AR介导基因转录调控及前列腺癌中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

Dyrk1A调控CaMKⅡ#948;的可变剪接及其在心脏重构过程中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

高亮度、窄线宽激光二极管阵列

国家自然科学基金

0+阅读 · 2009年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

How many dimensions are required to find an adversarial example?

Arxiv

0+阅读 · 2023年3月24日

Feature Separation and Recalibration for Adversarial Robustness

Arxiv

0+阅读 · 2023年3月24日

A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias

A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias

Arxiv

0+阅读 · 2023年3月23日

LearnedFTL: A Learning-based Page-level FTL for Improving Random Reads in Flash-based SSDs

Arxiv

0+阅读 · 2023年3月23日

Functional-Coefficient Quantile Regression for Panel Data with Latent Group Structure

Arxiv

0+阅读 · 2023年3月23日

Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval

Arxiv

0+阅读 · 2023年3月22日

Membership Inference Attacks against Diffusion Models

Arxiv

0+阅读 · 2023年3月22日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

VIP会员

文章信息

相关主题

MNIST (数据集)

最新内容

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

1+阅读 · 今天14:45

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

1+阅读 · 今天14:43

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

3+阅读 · 今天14:31

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

3+阅读 · 今天14:20

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

2+阅读 · 今天14:11

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

3+阅读 · 今天14:07

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

3+阅读 · 今天14:03

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

2+阅读 · 今天13:59

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

5+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

8+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

7+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

5+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

8+阅读 · 6月22日

相关VIP内容

不可错过！CMU《机器学习导论》2023课程，Matt Gormley带队讲授，附Slides

不可错过！CMU《机器学习导论》2023课程，Matt Gormley带队讲授，附Slides

专知会员服务

38+阅读 · 2023年2月7日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

46+阅读 · 2020年10月31日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

116+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【NeurIPS2019】基于累加噪声的对抗鲁棒性（Certified Adversarial Robustness with Additive Noise），Changyou Chen

【NeurIPS2019】基于累加噪声的对抗鲁棒性（Certified Adversarial Robustness with Additive Noise），Changyou Chen

专知会员服务

36+阅读 · 2019年11月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 世界动作模型：少做梦，多行动

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

美以伊冲突：无人机与人工智能的运用

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

How many dimensions are required to find an adversarial example?

Arxiv

0+阅读 · 2023年3月24日

Feature Separation and Recalibration for Adversarial Robustness

Arxiv

0+阅读 · 2023年3月24日

A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias

A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias

Arxiv

0+阅读 · 2023年3月23日

LearnedFTL: A Learning-based Page-level FTL for Improving Random Reads in Flash-based SSDs

Arxiv

0+阅读 · 2023年3月23日

Functional-Coefficient Quantile Regression for Panel Data with Latent Group Structure

Arxiv

0+阅读 · 2023年3月23日

Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval

Arxiv

0+阅读 · 2023年3月22日

Membership Inference Attacks against Diffusion Models

Arxiv

0+阅读 · 2023年3月22日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

相关基金

GPC3调控肝细胞癌侵袭和转移的信号通路研究

国家自然科学基金

0+阅读 · 2014年12月31日

电力设备tanδ在线监测中的信号去噪

国家自然科学基金

0+阅读 · 2013年12月31日

流体压缩性对微机电系统封装喷射泵点胶特性的影响机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

光纤激光传感阵列中的相干坍塌及抑制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

雷帕霉素通过Foxo3调节DCs耐受性抑制移植后肿瘤生长的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新因子hARAP3在AR介导基因转录调控及前列腺癌中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

Dyrk1A调控CaMKⅡ#948;的可变剪接及其在心脏重构过程中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

高亮度、窄线宽激光二极管阵列

国家自然科学基金

0+阅读 · 2009年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员