Model-less Is the Best Model: Generating Pure Code Implementations to Replace On-Device DL Models

Recent studies show that deployed deep learning (DL) models such as those of Tensor Flow Lite (TFLite) can be easily extracted from real-world applications and devices by attackers to generate many kinds of attacks like adversarial attacks. Although securing deployed on-device DL models has gained increasing attention, no existing methods can fully prevent the aforementioned threats. Traditional software protection techniques have been widely explored, if on-device models can be implemented using pure code, such as C++, it will open the possibility of reusing existing software protection techniques. However, due to the complexity of DL models, there is no automatic method that can translate the DL models to pure code. To fill this gap, we propose a novel method, CustomDLCoder, to automatically extract the on-device model information and synthesize a customized executable program for a wide range of DL models. CustomDLCoder first parses the DL model, extracts its backend computing units, configures the computing units to a graph, and then generates customized code to implement and deploy the ML solution without explicit model representation. The synthesized program hides model information for DL deployment environments since it does not need to retain explicit model representation, preventing many attacks on the DL model. In addition, it improves ML performance because the customized code removes model parsing and preprocessing steps and only retains the data computing process. Our experimental results show that CustomDLCoder improves model security by disabling on-device model sniffing. Compared with the original on-device platform (i.e., TFLite), our method can accelerate model inference by 21.8% and 24.3% on x86-64 and ARM64 platforms, respectively. Most importantly, it can significantly reduce memory consumption by 68.8% and 36.0% on x86-64 and ARM64 platforms, respectively.

翻译：近期研究表明，部署在真实应用和设备中的深度学习模型（如TensorFlow Lite模型）极易被攻击者提取，进而引发对抗攻击等多种攻击手段。尽管保护已部署的端侧深度学习模型日益受到关注，但现有方法无法完全消除上述威胁。传统软件保护技术已得到广泛探索，若能将端侧模型通过纯代码（如C++）实现，将有望复用现有软件保护技术。然而，由于深度学习模型的复杂性，目前尚无自动化方法能将DL模型转化为纯代码。为填补这一空白，我们提出了一种新颖方法CustomDLCoder，能够自动提取端侧模型信息，并针对各类深度学习模型合成定制化可执行程序。CustomDLCoder首先解析DL模型，提取其后端计算单元，将计算单元配置为计算图，随后生成定制化代码以实现和部署机器学习解决方案，无需显式模型表示。由于该合成程序无需保留显式模型表示，因此能够隐藏面向DL部署环境的模型信息，有效抵御针对DL模型的多类攻击。此外，定制化代码移除了模型解析与预处理步骤，仅保留数据计算过程，从而提升了机器学习性能。实验结果表明，CustomDLCoder通过禁用端侧模型嗅探提升了模型安全性。与原始端侧平台（TFLite）相比，本方法在x86-64和ARM64平台上分别实现模型推理加速21.8%和24.3%。更重要的是，其在x86-64和ARM64平台上的内存消耗分别显著降低68.8%和36.0%。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日