Forging the Unforgeable: On the Feasibility of Counterfeit Watermarks in Backdoor-Based Dataset Ownership Verification

Backdoor watermarking has emerged as the predominant approach for protecting public datasets, enabling dataset ownership verification (DOV) through embedded triggers that induce predefined model behaviors. While existing works assume that DOV results can serve as reliable evidence for copyright infringement claims, we argue that this assumption is fundamentally flawed. In this paper, we expose critical vulnerabilities in current backdoor watermarking schemes by demonstrating that attackers can forge watermarks that are statistically indistinguishable from the original ones, thereby evading infringement allegations. Specifically, we propose a Forged Watermark Generator (FW-Gen), a lightweight variational autoencoder-based framework that generates forged watermarks preserving the statistical properties of original watermarks while exhibiting distinct visual patterns. Our attack operates under a realistic threat model where an accused attacker, upon receiving an infringement claim, extracts watermark information from the protected dataset and produces counterfeit evidence to refute the allegation. Extensive experiments across six backdoor watermarking methods, two benchmark datasets, and two model architectures demonstrate that forged watermarks achieve equivalent or superior statistical significance in hypothesis testing compared to original watermarks. These findings reveal that current DOV mechanisms are insufficient as standalone evidence for copyright disputes and call for more robust dataset protection schemes.

翻译：后门水印已成为保护公共数据集的主流方法，其通过嵌入能诱发预设模型行为的触发器来实现数据集所有权验证。现有研究通常假设所有权验证结果可作为版权侵权索赔的可靠证据，但我们认为这一假设存在根本性缺陷。本文通过证明攻击者能够伪造与原始水印在统计上无法区分的水印，从而规避侵权指控，揭示了当前后门水印方案的关键漏洞。具体而言，我们提出了伪造水印生成器——一种基于变分自编码器的轻量级框架，该框架生成的伪造水印在保持原始水印统计特性的同时，呈现独特的视觉模式。我们的攻击在现实威胁模型下实施：被指控的攻击者在收到侵权声明后，从受保护数据集中提取水印信息，并生成伪造证据以反驳指控。通过对六种后门水印方法、两个基准数据集和两种模型架构的广泛实验表明，在假设检验中，伪造水印能达到与原始水印相当或更优的统计显著性。这些发现表明，当前的所有权验证机制不足以作为版权纠纷的独立证据，亟需设计更鲁棒的数据集保护方案。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

面向 AI 生成图像的安全与鲁棒水印：全面综述

专知会员服务

14+阅读 · 2025年10月6日

计算机视觉领域的后门攻击与防御：综述

专知会员服务

19+阅读 · 2025年9月13日

扩散模型时代的可视水印：进展与挑战

专知会员服务

7+阅读 · 2025年5月17日

ACM Computing Surveys | 港大等基于可靠性视角的深度伪造检测综述，覆盖主流基准库、模型

专知会员服务

17+阅读 · 2025年1月12日