Efficient Defense Against Model Stealing Attacks on Convolutional Neural Networks

from arxiv, Accepted for publication at 2023 International Conference on Machine Learning and Applications (ICMLA). Proceedings of ICMLA, Florida, USA \c{opyright}2023 IEEE

Model stealing attacks have become a serious concern for deep learning models, where an attacker can steal a trained model by querying its black-box API. This can lead to intellectual property theft and other security and privacy risks. The current state-of-the-art defenses against model stealing attacks suggest adding perturbations to the prediction probabilities. However, they suffer from heavy computations and make impracticable assumptions about the adversary. They often require the training of auxiliary models. This can be time-consuming and resource-intensive which hinders the deployment of these defenses in real-world applications. In this paper, we propose a simple yet effective and efficient defense alternative. We introduce a heuristic approach to perturb the output probabilities. The proposed defense can be easily integrated into models without additional training. We show that our defense is effective in defending against three state-of-the-art stealing attacks. We evaluate our approach on large and quantized (i.e., compressed) Convolutional Neural Networks (CNNs) trained on several vision datasets. Our technique outperforms the state-of-the-art defenses with a $\times37$ faster inference latency without requiring any additional model and with a low impact on the model's performance. We validate that our defense is also effective for quantized CNNs targeting edge devices.

翻译：模型窃取攻击已成为深度学习模型面临的严重威胁，攻击者可通过查询黑盒API窃取训练好的模型，进而导致知识产权盗窃及其他安全与隐私风险。当前最先进的模型窃取防御方法建议在预测概率中添加扰动，但这些方法存在计算开销大、对攻击者的假设不切实际等问题，通常需要训练辅助模型。这种耗时且消耗资源的过程阻碍了防御措施在真实场景中的部署。本文提出一种简单高效且有效的替代防御方案，通过启发式方法扰动输出概率。该防御方法无需额外训练即可轻松集成至现有模型。实验表明，该方法能有效防御三种最先进的窃取攻击。我们在多个视觉数据集训练的大规模及量化（即压缩）卷积神经网络（CNN）上评估了本方案。与现有最先进防御方法相比，本技术在不需额外模型且对模型性能影响较小的前提下，推理延迟降低了37倍。此外，我们验证了该防御方法对面向边缘设备的量化CNN同样有效。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日