Rethinking White-Box Watermarks on Deep Learning Models under Neural Structural Obfuscation

Copyright protection for deep neural networks (DNNs) is an urgent need for AI corporations. To trace illegally distributed model copies, DNN watermarking is an emerging technique for embedding and verifying secret identity messages in the prediction behaviors or the model internals. Sacrificing less functionality and involving more knowledge about the target DNN, the latter branch called \textit{white-box DNN watermarking} is believed to be accurate, credible and secure against most known watermark removal attacks, with emerging research efforts in both the academy and the industry. In this paper, we present the first systematic study on how the mainstream white-box DNN watermarks are commonly vulnerable to neural structural obfuscation with \textit{dummy neurons}, a group of neurons which can be added to a target model but leave the model behavior invariant. Devising a comprehensive framework to automatically generate and inject dummy neurons with high stealthiness, our novel attack intensively modifies the architecture of the target model to inhibit the success of watermark verification. With extensive evaluation, our work for the first time shows that nine published watermarking schemes require amendments to their verification procedures.

翻译：深度神经网络（DNN）的版权保护是人工智能企业的迫切需求。为追踪非法分发的模型副本，DNN水印技术通过在预测行为或模型内部嵌入并验证秘密身份信息，成为新兴技术方案。牺牲较小功能并涉及更多目标DNN知识的后一分支——即白盒DNN水印——被认为具有高精度、高可信度且能抵御大多数已知水印移除攻击，学术界与工业界正在涌现相关研究。本文首次系统性地揭示了：主流白盒DNN水印普遍易受基于“虚拟神经元”的神经结构混淆攻击——这种神经元可被添加至目标模型但保持模型行为不变。我们设计了一套综合性攻击框架，能自动生成并注入高度隐蔽的虚拟神经元，通过大幅修改目标模型的架构来抑制水印验证的成功率。经广泛评估，本文首次证明了九种已发表的水印方案均需修正其验证流程。

相关内容

白盒

关注 0

白盒测试（也称为透明盒测试，玻璃盒测试，透明盒测试和结构测试）是一种软件测试方法，用于测试应用程序的内部结构或功能，而不是其功能（即黑盒测试）。在白盒测试中，系统的内部视角以及编程技能被用来设计测试用例。测试人员选择输入以遍历代码的路径并确定预期的输出。这类似于测试电路中的节点，在线测试（ICT）。白盒测试可以应用于软件测试过程的单元，集成和系统级别。尽管传统的测试人员倾向于将白盒测试视为在单元级别进行的，但如今它已越来越频繁地用于集成和系统测试。它可以测试单元内的路径，集成期间单元之间的路径以及系统级测试期间子系统之间的路径。

【腾讯等】可信赖图学习：可靠性、可解释性和隐私保护，A Survey of Trustworthy Graph Learning: Reliability, Explainability, and Privacy Protection

专知会员服务

20+阅读 · 2022年5月24日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

105+阅读 · 2022年2月10日

【CVPR2021教程】计算机视觉中的可解释机器学习

专知会员服务

64+阅读 · 2021年6月22日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

46+阅读 · 2020年10月31日