Beyond Binary Classification: A Semi-supervised Approach to Generalized AI-generated Image Detection

The rapid advancement of generators (e.g., StyleGAN, Midjourney, DALL-E) has produced highly realistic synthetic images, posing significant challenges to digital media authenticity. These generators are typically based on a few core architectural families, primarily Generative Adversarial Networks (GANs) and Diffusion Models (DMs). A critical vulnerability in current forensics is the failure of detectors to achieve cross-generator generalization, especially when crossing architectural boundaries (e.g., from GANs to DMs). We hypothesize that this gap stems from fundamental differences in the artifacts produced by these \textbf{distinct architectures}. In this work, we provide a theoretical analysis explaining how the distinct optimization objectives of the GAN and DM architectures lead to different manifold coverage behaviors. We demonstrate that GANs permit partial coverage, often leading to boundary artifacts, while DMs enforce complete coverage, resulting in over-smoothing patterns. Motivated by this analysis, we propose the \textbf{Tri}archy \textbf{Detect}or (TriDetect), a semi-supervised approach that enhances binary classification by discovering latent architectural patterns within the "fake" class. TriDetect employs balanced cluster assignment via the Sinkhorn-Knopp algorithm and a cross-view consistency mechanism, encouraging the model to learn fundamental architectural distincts. We evaluate our approach on two standard benchmarks and three in-the-wild datasets against 13 baselines to demonstrate its generalization capability to unseen generators.

翻译：生成器（如StyleGAN、Midjourney、DALL-E）的快速发展已能产生高度逼真的合成图像，对数字媒体真实性构成重大挑战。这些生成器通常基于少数核心架构家族，主要是生成对抗网络（GANs）和扩散模型（DMs）。当前取证技术的一个关键缺陷是检测器无法实现跨生成器的泛化，尤其是在跨越架构边界时（例如从GANs到DMs）。我们假设这一差距源于这些不同架构产生的伪影存在根本性差异。在本研究中，我们通过理论分析解释了GAN和DM架构的不同优化目标如何导致不同的流形覆盖行为。我们证明GANs允许部分覆盖，常导致边界伪影，而DMs强制完全覆盖，产生过度平滑模式。基于此分析，我们提出Triarchy检测器（TriDetect），这是一种半监督方法，通过发现“虚假”类别内的潜在架构模式来增强二元分类。TriDetect采用Sinkhorn-Knopp算法实现平衡聚类分配，并引入跨视图一致性机制，促使模型学习基本的架构差异。我们在两个标准基准和三个真实场景数据集上评估了该方法，对比了13个基线模型，以证明其对未见生成器的泛化能力。

相关内容

生成器

关注 2

生成器是一次生成一个值的特殊类型函数。可以将其视为可恢复函数。调用该函数将返回一个可用于生成连续 x 值的生成【Generator】，简单的说就是在函数的执行过程中，yield语句会把你需要的值返回给调用生成器的地方，然后退出函数，下一次调用生成器函数的时候又从上次中断的地方开始执行，而生成器内的所有变量参数都会被保存下来供下一次使用。

【ICML2025】QuRe：通过困难负样本采样实现查询相关的组合图像检索

专知会员服务

7+阅读 · 2025年7月20日

[ICCV2025]EAMamba：面向图像恢复的高效全能视觉状态空间模型

专知会员服务

5+阅读 · 2025年7月1日

【ICLR2025】为多模态图像-文本表示可解释性缩小信息瓶颈理论

专知会员服务

15+阅读 · 2025年2月24日

【AAAI2025】TimeDP：通过领域提示学习生成多领域时间序列

专知会员服务

14+阅读 · 2025年1月10日