ChatGPT has gained significant interest due to its impressive performance, but people are increasingly concerned about its potential risks, particularly around the detection of AI-generated content (AIGC), which is often difficult for untrained humans to identify. Current datasets utilized for detecting ChatGPT-generated text primarily center around question-answering, yet they tend to disregard tasks that possess semantic-invariant properties, such as summarization, translation, and paraphrasing. Our primary studies demonstrate that detecting model-generated text on semantic-invariant tasks is more difficult. To fill this gap, we introduce a more extensive and comprehensive dataset that considers more types of tasks than previous work, including semantic-invariant tasks. In addition, the model after a large number of task instruction fine-tuning shows a strong powerful performance. Owing to its previous success, we further instruct fine-tuning Tk-instruct and built a more powerful detection system. Experimental results show that our proposed detector outperforms the previous state-of-the-art RoBERTa-based detector.
翻译:ChatGPT因其卓越性能引起了广泛关注,但人们对其潜在风险日益担忧,尤其是在人工智能生成内容(AIGC)的检测方面——这一内容通常未经训练的普通人难以识别。当前用于检测ChatGPT生成文本的数据集主要聚焦于问答任务,却往往忽视了具有语义不变属性的任务(如摘要、翻译和改写)。我们的初步研究表明,在语义不变任务中检测模型生成文本更为困难。为弥补这一空白,我们引入了一个更广泛、更全面的数据集,涵盖了比先前工作更多类型的任务,包括语义不变任务。此外,经过大量任务指令微调后的模型展现出强大的性能。鉴于其先前取得的成功,我们进一步对Tk-instruct进行了指令微调,构建了一个更强大的检测系统。实验结果表明,我们提出的检测器优于先前基于RoBERTa的最先进检测器。