Large Language Models (LLMs), exemplified by ChatGPT, have significantly reshaped text generation, particularly in the realm of writing assistance. While ethical considerations underscore the importance of transparently acknowledging LLM use, especially in scientific communication, genuine acknowledgment remains infrequent. A potential avenue to encourage accurate acknowledging of LLM-assisted writing involves employing automated detectors. Our evaluation of four cutting-edge LLM-generated text detectors reveals their suboptimal performance compared to a simple ad-hoc detector designed to identify abrupt writing style changes around the time of LLM proliferation. We contend that the development of specialized detectors exclusively dedicated to LLM-assisted writing detection is necessary. Such detectors could play a crucial role in fostering more authentic recognition of LLM involvement in scientific communication, addressing the current challenges in acknowledgment practices.
翻译:以ChatGPT为代表的大型语言模型(LLMs)已显著重塑了文本生成领域,尤其在写作辅助方面。尽管伦理考量强调了在科学交流中透明承认LLM使用的重要性,但真实的承认行为仍不常见。促进准确承认LLM辅助写作的一个潜在途径是采用自动检测器。我们对四种前沿LLM生成文本检测器的评估表明,相较于专门为识别LLM普及时期写作风格突变而设计的简易临时检测器,现有检测器性能欠佳。我们认为有必要开发专门用于检测LLM辅助写作的特化检测器。此类检测器可在促进科学交流中更真实地承认LLM参与方面发挥关键作用,从而应对当前学术承认实践中的挑战。