FAID: Fine-Grained AI-Generated Text Detection Using Multi-Task Auxiliary and Multi-Level Contrastive Learning

The growing collaboration between humans and AI models in generative tasks has introduced new challenges in distinguishing between human-written, LLM-generated, and human-LLM collaborative texts. In this work, we collect a multilingual, multi-domain, multi-generator dataset FAIDSet. We further introduce a fine-grained detection framework FAID to classify text into these three categories, and also to identify the underlying LLM family of the generator. Unlike existing binary classifiers, FAID is built to capture both authorship and model-specific characteristics. Our method combines multi-level contrastive learning with multi-task auxiliary classification to learn subtle stylistic cues. By modeling LLM families as distinct stylistic entities, we incorporate an adaptation to address distributional shifts without retraining for unseen data. Our experimental results demonstrate that FAID outperforms several baselines, particularly enhancing the generalization accuracy on unseen domains and new LLMs, thus offering a potential solution for improving transparency and accountability in AI-assisted writing. Our data and code are available at https://github.com/mbzuai-nlp/FAID

翻译：随着人类与AI模型在生成任务中的协作日益增多，区分人类撰写、大语言模型生成以及人机协作文本带来了新的挑战。本研究收集了一个多语言、多领域、多生成器的数据集FAIDSet。我们进一步提出了一个细粒度检测框架FAID，旨在将文本分类为上述三类，并识别生成文本的底层大语言模型家族。与现有的二分类器不同，FAID旨在捕捉作者身份和模型特异性特征。我们的方法结合了多层次对比学习与多任务辅助分类，以学习细微的风格线索。通过将不同的大语言模型家族建模为独特的风格实体，我们引入了一种适应机制，以应对分布偏移，而无需针对未见数据重新训练。实验结果表明，FAID在多个基线模型上表现更优，特别是在未见领域和新大语言模型上的泛化准确性显著提升，从而为增强AI辅助写作的透明度和可问责性提供了一个潜在的解决方案。我们的数据和代码公开于 https://github.com/mbzuai-nlp/FAID。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【NeurIPS2025】DNA-DetectLLM：基于 DNA 启发的“突变-修复”范式揭示 AI 生成文本

专知会员服务

12+阅读 · 2025年9月22日

文本、视觉与语音生成的自动化评估方法综述

专知会员服务

20+阅读 · 2025年6月15日

AI生成媒体检测综述：从非多模态大语言模型到多模态大语言模型

专知会员服务

18+阅读 · 2025年2月11日

《人工智能生成式文本检测：数据集和数据生成》最新39页报告

专知会员服务

32+阅读 · 2024年12月18日