Investigating White-Box Attacks for On-Device Models

Numerous mobile apps have leveraged deep learning capabilities. However, on-device models are vulnerable to attacks as they can be easily extracted from their corresponding mobile apps. Existing on-device attacking approaches only generate black-box attacks, which are far less effective and efficient than white-box strategies. This is because mobile deep learning frameworks like TFLite do not support gradient computing, which is necessary for white-box attacking algorithms. Thus, we argue that existing findings may underestimate the harmfulness of on-device attacks. To this end, we conduct a study to answer this research question: Can on-device models be directly attacked via white-box strategies? We first systematically analyze the difficulties of transforming the on-device model to its debuggable version, and propose a Reverse Engineering framework for On-device Models (REOM), which automatically reverses the compiled on-device TFLite model to the debuggable model. Specifically, REOM first transforms compiled on-device models into Open Neural Network Exchange format, then removes the non-debuggable parts, and converts them to the debuggable DL models format that allows attackers to exploit in a white-box setting. Our experimental results show that our approach is effective in achieving automated transformation among 244 TFLite models. Compared with previous attacks using surrogate models, REOM enables attackers to achieve higher attack success rates with a hundred times smaller attack perturbations. In addition, because the ONNX platform has plenty of tools for model format exchanging, the proposed method based on the ONNX platform can be adapted to other model formats. Our findings emphasize the need for developers to carefully consider their model deployment strategies, and use white-box methods to evaluate the vulnerability of on-device models.

翻译：众多移动应用已具备深度学习能力。然而，端侧模型极易遭受攻击，因其可从相应移动应用中被轻易提取。现有端侧攻击方法仅能生成黑盒攻击，其效果与效率远不及白盒策略。这是由于TFLite等移动深度学习框架不支持梯度计算，而梯度计算是白盒攻击算法的必要前提。因此，我们认为现有研究可能低估了端侧攻击的危害性。为此，我们开展了一项研究以回答以下研究问题：端侧模型能否通过白盒策略直接遭受攻击？我们首先系统分析了将端侧模型转化为可调试版本的难点，并提出面向端侧模型的逆向工程框架（REOM），该框架能自动将已编译的端侧TFLite模型逆向为可调试模型。具体而言，REOM首先将已编译的端侧模型转换为开放神经网络交换格式，随后移除不可调试部分，并将其转换为允许攻击者在白盒场景中利用的可调试深度学习模型格式。实验结果表明，我们的方法在244个TFLite模型上实现了有效的自动转换。与先前使用替代模型的攻击相比，REOM能使攻击者在攻击扰动缩小百倍的情况下实现更高攻击成功率。此外，由于ONNX平台拥有丰富的模型格式转换工具，基于ONNX平台提出的方法可适配其他模型格式。本研究发现强调，开发者需审慎考虑模型部署策略，并采用白盒方法评估端侧模型的脆弱性。

相关内容

白盒

关注 0

白盒测试（也称为透明盒测试，玻璃盒测试，透明盒测试和结构测试）是一种软件测试方法，用于测试应用程序的内部结构或功能，而不是其功能（即黑盒测试）。在白盒测试中，系统的内部视角以及编程技能被用来设计测试用例。测试人员选择输入以遍历代码的路径并确定预期的输出。这类似于测试电路中的节点，在线测试（ICT）。白盒测试可以应用于软件测试过程的单元，集成和系统级别。尽管传统的测试人员倾向于将白盒测试视为在单元级别进行的，但如今它已越来越频繁地用于集成和系统测试。它可以测试单元内的路径，集成期间单元之间的路径以及系统级测试期间子系统之间的路径。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日