Generative image modeling enables a wide range of applications but raises ethical concerns about responsible deployment. This paper introduces an active strategy combining image watermarking and Latent Diffusion Models. The goal is for all generated images to conceal an invisible watermark allowing for future detection and/or identification. The method quickly fine-tunes the latent decoder of the image generator, conditioned on a binary signature. A pre-trained watermark extractor recovers the hidden signature from any generated image and a statistical test then determines whether it comes from the generative model. We evaluate the invisibility and robustness of the watermarks on a variety of generation tasks, showing that Stable Signature works even after the images are modified. For instance, it detects the origin of an image generated from a text prompt, then cropped to keep $10\%$ of the content, with $90$+$\%$ accuracy at a false positive rate below 10$^{-6}$.
翻译:生成式图像建模支持广泛的应用,但引发了关于负责任部署的伦理关切。本文提出了一种结合图像水印与潜在扩散模型的主动策略,目标是使所有生成的图像都隐藏不可见的水印,以便未来进行检测和/或识别。该方法基于二进制签名条件,对图像生成器的潜在解码器进行快速微调。预训练的水印提取器从任何生成的图像中恢复隐藏的签名,并通过统计检验确定该图像是否来自生成模型。我们在多种生成任务上评估了水印的不可见性和鲁棒性,结果表明即使经过图像修改,稳定签名仍能有效工作。例如,对于基于文本提示生成的图像,即使裁剪至仅保留10%的内容,该方法仍能以低于10^{-6}的误报率实现90%以上的检测准确率。