ChatGPT has become a global sensation. As ChatGPT and other Large Language Models (LLMs) emerge, concerns of misusing them in various ways increase, such as disseminating fake news, plagiarism, manipulating public opinion, cheating, and fraud. Hence, distinguishing AI-generated from human-generated becomes increasingly essential. Researchers have proposed various detection methodologies, ranging from basic binary classifiers to more complex deep-learning models. Some detection techniques rely on statistical characteristics or syntactic patterns, while others incorporate semantic or contextual information to improve accuracy. The primary objective of this study is to provide a comprehensive and contemporary assessment of the most recent techniques in ChatGPT detection. Additionally, we evaluated other AI-generated text detection tools that do not specifically claim to detect ChatGPT-generated content to assess their performance in detecting ChatGPT-generated content. For our evaluation, we have curated a benchmark dataset consisting of prompts from ChatGPT and humans, including diverse questions from medical, open Q&A, and finance domains and user-generated responses from popular social networking platforms. The dataset serves as a reference to assess the performance of various techniques in detecting ChatGPT-generated content. Our evaluation results demonstrate that none of the existing methods can effectively detect ChatGPT-generated content.
翻译:ChatGPT已成为全球现象。随着ChatGPT及其他大型语言模型(LLMs)的出现,人们对以各种方式滥用它们的担忧与日俱增,例如传播假新闻、剽窃、操纵舆论、作弊和欺诈。因此,区分AI生成内容与人类生成内容变得愈发重要。研究人员提出了多种检测方法,从基础的二元分类器到更复杂的深度学习模型。一些检测技术依赖于统计特征或句法模式,而另一些则结合语义或上下文信息以提高准确性。本研究的主要目标是全面且最新地评估ChatGPT检测方面的最新技术。此外,我们还评估了其他未明确声称能检测ChatGPT生成内容的AI文本检测工具,以考察它们在检测ChatGPT生成内容方面的性能。在评估中,我们整理了一个基准数据集,包含来自ChatGPT和人类的提示词,涵盖医疗、开放问答和金融领域的多样化问题,以及来自热门社交网络平台的用户生成回答。该数据集作为参考,用于评估各种技术在检测ChatGPT生成内容时的性能。我们的评估结果表明,现有方法均无法有效检测ChatGPT生成的内容。