边界框水印：针对目标检测器模型提取攻击的防御方法 (Bounding-box Watermarking: Defense against Model Extraction Attacks on Object Detectors)

from arxiv, Accepted at ECML-PKDD2025. Please refer to the conference proceedings for the final version. Source codes: https://zenodo.org/records/15641464

Deep neural networks (DNNs) deployed in a cloud often allow users to query models via the APIs. However, these APIs expose the models to model extraction attacks (MEAs). In this attack, the attacker attempts to duplicate the target model by abusing the responses from the API. Backdoor-based DNN watermarking is known as a promising defense against MEAs, wherein the defender injects a backdoor into extracted models via API responses. The backdoor is used as a watermark of the model; if a suspicious model has the watermark (i.e., backdoor), it is verified as an extracted model. This work focuses on object detection (OD) models. Existing backdoor attacks on OD models are not applicable for model watermarking as the defense against MEAs on a realistic threat model. Our proposed approach involves inserting a backdoor into extracted models via APIs by stealthily modifying the bounding-boxes (BBs) of objects detected in queries while keeping the OD capability. In our experiments on three OD datasets, the proposed approach succeeded in identifying the extracted models with 100% accuracy in a wide variety of experimental scenarios.

翻译：部署在云端的深度神经网络（DNNs）通常允许用户通过API查询模型。然而，这些API使模型暴露于模型提取攻击（MEAs）的风险之下。在此类攻击中，攻击者试图通过滥用API的响应来复制目标模型。基于后门的DNN水印技术被认为是一种对抗MEAs的有效防御手段，其中防御者通过API响应将后门注入被提取的模型中。该后门被用作模型的水印；如果一个可疑模型含有该水印（即后门），则被验证为提取模型。本工作聚焦于目标检测（OD）模型。现有的针对OD模型的后门攻击由于不符合现实威胁模型下的防御需求，不适用于作为对抗MEAs的模型水印方案。我们提出的方法通过API向被提取的模型中植入后门，具体手段是隐秘地修改查询中检测到的目标边界框（BBs），同时保持OD能力。在三个OD数据集上的实验表明，所提方法在多种实验场景下均能以100%的准确率成功识别被提取的模型。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日