Your AI-Generated Image Detector Can Secretly Achieve SOTA Accuracy, If Calibrated

Despite being trained on balanced datasets, existing AI-generated image detectors often exhibit systematic bias at test time, frequently misclassifying fake images as real. We hypothesize that this behavior stems from distributional shift in fake samples and implicit priors learned during training. Specifically, models tend to overfit to superficial artifacts that do not generalize well across different generation methods, leading to a misaligned decision threshold when faced with test-time distribution shift. To address this, we propose a theoretically grounded post-hoc calibration framework based on Bayesian decision theory. In particular, we introduce a learnable scalar correction to the model's logits, optimized on a small validation set from the target distribution while keeping the backbone frozen. This parametric adjustment compensates for distributional shift in model output, realigning the decision boundary even without requiring ground-truth labels. Experiments on challenging benchmarks show that our approach significantly improves robustness without retraining, offering a lightweight and principled solution for reliable and adaptive AI-generated image detection in the open world. Code is available at https://github.com/muliyangm/AIGI-Det-Calib.

翻译：尽管在平衡数据集上训练，现有AI生成图像检测器在测试时仍常表现出系统性偏差，频繁将伪造图像误判为真实图像。我们假设该行为源于伪造样本的分布偏移及训练期间习得的隐式先验。具体而言，模型倾向于过度拟合无法在不同生成方法间良好泛化的表面伪影，导致面对测试时分布偏移时决策阈值失准。为解决此问题，我们提出基于贝叶斯决策理论的、具有理论依据的事后校准框架。特别地，我们引入对模型逻辑值的可学习标量校正，该校正通过在目标分布的小型验证集上优化获得，同时保持主干网络冻结。这种参数化调整补偿了模型输出的分布偏移，即使无需真实标签也能重新对齐决策边界。在具有挑战性的基准测试上的实验表明，我们的方法无需重新训练即可显著提升鲁棒性，为开放世界中可靠且自适应的AI生成图像检测提供了轻量级、原理性解决方案。代码发布于https://github.com/muliyangm/AIGI-Det-Calib。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【ICCV2025】AIGI-Holmes：面向可解释性与可泛化性的AI生成图像检测方法 —— 基于多模态大语言模型的研究

专知会员服务

10+阅读 · 2025年7月4日

Sora背后的技术，最新《可控生成与文本到图像扩散模型》综述

专知会员服务

69+阅读 · 2024年3月9日

《深度伪造检测模型的准确性和鲁棒性》2023最新论文

专知会员服务

42+阅读 · 2023年10月29日