Large Language Models (LLMs), exemplified by ChatGPT, have significantly reshaped text generation, particularly in the realm of writing assistance. While ethical considerations underscore the importance of transparently acknowledging LLM use, especially in scientific communication, genuine acknowledgment remains infrequent. A potential avenue to encourage accurate acknowledging of LLM-assisted writing involves employing automated detectors. Our evaluation of four cutting-edge LLM-generated text detectors reveals their suboptimal performance compared to a simple ad-hoc detector designed to identify abrupt writing style changes around the time of LLM proliferation. We contend that the development of specialized detectors exclusively dedicated to LLM-assisted writing detection is necessary. Such detectors could play a crucial role in fostering more authentic recognition of LLM involvement in scientific communication, addressing the current challenges in acknowledgment practices.
翻译:以ChatGPT为代表的大语言模型已显著重塑文本生成领域,尤其在写作辅助方面。尽管伦理考量强调在科学交流中透明承认大语言模型使用的重要性,但实际承认行为仍不普遍。促进准确承认大语言模型辅助写作的一个潜在途径是采用自动检测器。我们对四种前沿的大语言模型生成文本检测器进行评估,发现其性能逊于一种简单的临时检测器——该检测器专为识别大语言模型普及时期写作风格的突变而设计。我们认为有必要开发专门用于检测大语言模型辅助写作的专用检测器。此类检测器可在促进更真实地认识大语言模型参与科学交流方面发挥关键作用,从而应对当前承认实践中的挑战。