Watermark-based Attribution of AI-Generated Content

Several companies have deployed watermark-based detection to identify AI-generated content. However, attribution--the ability to trace back to the user of a generative AI (GenAI) service who created a given AI-generated content--remains largely unexplored despite its growing importance. In this work, we aim to bridge this gap by conducting the first systematic study on watermark-based, user-level attribution of AI-generated content. Our key idea is to assign a unique watermark to each user of the GenAI service and embed this watermark into the AI-generated content created by that user. Attribution is then performed by identifying the user whose watermark best matches the one extracted from the given content. This approach, however, faces a key challenge: How should watermarks be selected for users to maximize attribution performance? To address the challenge, we first theoretically derive lower bounds on detection and attribution performance through rigorous probabilistic analysis for any given set of user watermarks. Then, we select watermarks for users to maximize these lower bounds, thereby optimizing detection and attribution performance. Our theoretical and empirical results show that watermark-based attribution inherits both the accuracy and (non-)robustness properties of the underlying watermark. Specifically, attribution remains highly accurate when the watermarked AI-generated content is either not post-processed or subjected to common post-processing such as JPEG compression, as well as black-box adversarial post-processing with limited query budgets.

翻译：多家公司已部署基于水印的检测技术以识别AI生成内容。然而，溯源——即追踪特定AI生成内容至生成式AI服务使用者的能力——尽管日益重要，目前仍鲜有研究。本工作旨在填补这一空白，首次对基于水印的用户级AI生成内容溯源进行系统性研究。我们的核心思路是为生成式AI服务的每位用户分配唯一水印，并将该水印嵌入相应用户创建的AI生成内容中。通过识别与待检测内容提取水印匹配度最高的用户，即可实现溯源。但该方法面临关键挑战：应如何为用户选择水印以最大化溯源性能？为解决该问题，我们首先通过严格概率分析，针对任意给定用户水印集合，理论推导出检测与溯源性能的下界。随后通过优化水印选择来提升这些下界，从而实现检测与溯源性能的最优化。理论与实证结果表明：基于水印的溯源方法继承了底层水印的准确性与（非）鲁棒性特征。具体而言，当水印化AI生成内容未经后处理、或经受JPEG压缩等常规后处理、以及查询预算受限的黑盒对抗性后处理时，溯源仍能保持较高准确性。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

面向 AI 生成图像的安全与鲁棒水印：全面综述

专知会员服务

14+阅读 · 2025年10月6日

【ICCV2025】AIGI-Holmes：面向可解释性与可泛化性的AI生成图像检测方法 —— 基于多模态大语言模型的研究

专知会员服务

10+阅读 · 2025年7月4日

《内容凭证：加强生成式人工智能时代的多媒体完整性》最新25页报告

专知会员服务

20+阅读 · 2025年3月4日

《人工智能生成式文本检测：数据集和数据生成》最新39页报告

专知会员服务

32+阅读 · 2024年12月18日