VTONGuard: Automatic Detection and Authentication of AI-Generated Virtual Try-On Content

With the rapid advancement of generative AI, virtual try-on (VTON) systems are becoming increasingly common in e-commerce and digital entertainment. However, the growing realism of AI-generated try-on content raises pressing concerns about authenticity and responsible use. To address this, we present VTONGuard, a large-scale benchmark dataset containing over 775,000 real and synthetic try-on images. The dataset covers diverse real-world conditions, including variations in pose, background, and garment styles, and provides both authentic and manipulated examples. Based on this benchmark, we conduct a systematic evaluation of multiple detection paradigms under unified training and testing protocols. Our results reveal each method's strengths and weaknesses and highlight the persistent challenge of cross-paradigm generalization. To further advance detection, we design a multi-task framework that integrates auxiliary segmentation to enhance boundary-aware feature learning, achieving the best overall performance on VTONGuard. We expect this benchmark to enable fair comparisons, facilitate the development of more robust detection models, and promote the safe and responsible deployment of VTON technologies in practice.

翻译：随着生成式人工智能的快速发展，虚拟试穿系统在电子商务和数字娱乐领域日益普及。然而，AI生成的试穿内容在真实感上的不断提升，引发了关于内容真实性与负责任使用的迫切担忧。为此，我们提出了VTONGuard——一个包含超过77.5万张真实与合成试穿图像的大规模基准数据集。该数据集涵盖了多样化的真实场景条件，包括姿态、背景和服装风格的变化，并同时提供了真实样本与经过篡改的样本。基于此基准，我们在统一的训练与测试协议下，对多种检测范式进行了系统性评估。我们的结果揭示了每种方法的优势与不足，并凸显了跨范式泛化这一持续存在的挑战。为了进一步推进检测技术，我们设计了一个多任务框架，该框架通过集成辅助分割任务来增强边界感知特征学习，从而在VTONGuard上实现了最佳的综合性能。我们期望该基准能够支持公平比较，促进开发更鲁棒的检测模型，并推动VTON技术在实践中安全、负责任地部署。

相关内容

关注 7103

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

《战术训练虚拟士兵：一种用于自适应军事模拟的生成式人工智能框架》最新文献

专知会员服务

27+阅读 · 2025年9月24日

【ICCV2025】AIGI-Holmes：面向可解释性与可泛化性的AI生成图像检测方法 —— 基于多模态大语言模型的研究

专知会员服务

10+阅读 · 2025年7月4日

虚幻5加持，清华发布首个「真实开放环境具身智能平台」与基准测试集EmbodiedCity！

专知会员服务

26+阅读 · 2024年10月17日

WDTA-2024生成式AI应用程序安全测试和验证标准

专知会员服务

35+阅读 · 2024年4月24日