$ $The usage of generative artificial intelligence (AI) tools based on large language models, including ChatGPT, Bard, and Claude, for text generation has many exciting applications with the potential for phenomenal productivity gains. One issue is authorship attribution when using AI tools. This is especially important in an academic setting where the inappropriate use of generative AI tools may hinder student learning or stifle research by creating a large amount of automatically generated derivative work. Existing plagiarism detection systems can trace the source of submitted text but are not yet equipped with methods to accurately detect AI-generated text. This paper introduces the idea of direct origin detection and evaluates whether generative AI systems can recognize their output and distinguish it from human-written texts. We argue why current transformer-based models may be able to self-detect their own generated text and perform a small empirical study using zero-shot learning to investigate if that is the case. Results reveal varying capabilities of AI systems to identify their generated text. Google's Bard model exhibits the largest capability of self-detection with an accuracy of 94\%, followed by OpenAI's ChatGPT with 83\%. On the other hand, Anthropic's Claude model seems to be not able to self-detect.
翻译:基于大型语言模型的生成式人工智能(AI)工具(包括ChatGPT、Bard和Claude)在文本生成方面具有许多令人兴奋的应用,并有望带来显著的生产力提升。然而,使用AI工具时的一个关键问题是作者归属。这在学术环境中尤为重要,因为不当使用生成式AI工具可能阻碍学生学习,或通过生成大量自动衍生的作品而抑制研究创新。现有的抄袭检测系统可以追踪提交文本的来源,但尚未配备能够准确检测AI生成文本的方法。本文提出了直接来源检测的概念,并评估了生成式AI系统能否识别自身生成的内容,并将其与人类撰写的文本区分开来。我们论证了当前基于Transformer的模型为何可能具备自检测其自身生成文本的能力,并通过零样本学习进行了一项小型实证研究以验证这一假设。结果表明,不同AI系统在识别自身生成文本方面的能力存在差异。谷歌的Bard模型展现出最强的自检测能力,准确率达94%,其次是OpenAI的ChatGPT,准确率为83%。相比之下,Anthropic的Claude模型似乎无法实现自检测。