Large Language Models (LLMs), exemplified by ChatGPT, have significantly reshaped text generation, particularly in the realm of writing assistance. While ethical considerations underscore the importance of transparently acknowledging LLM use, especially in scientific communication, genuine acknowledgment remains infrequent. A potential avenue to encourage accurate acknowledging of LLM-assisted writing involves employing automated detectors. Our evaluation of four cutting-edge LLM-generated text detectors reveals their suboptimal performance compared to a simple ad-hoc detector designed to identify abrupt writing style changes around the time of LLM proliferation. We contend that the development of specialized detectors exclusively dedicated to LLM-assisted writing detection is necessary. Such detectors could play a crucial role in fostering more authentic recognition of LLM involvement in scientific communication, addressing the current challenges in acknowledgment practices.
翻译:大型语言模型(LLMs),以ChatGPT为代表,已显著重塑了文本生成,尤其是在写作辅助领域。虽然伦理考量凸显了透明标注LLM使用的重要性,尤其是在科学交流中,但真实的标注仍不常见。鼓励准确标注LLM辅助写作的一种潜在途径是采用自动检测器。我们对四种前沿的LLM生成文本检测器进行评估,发现它们的性能逊于一个简单的即席检测器——该检测器旨在识别LLM普及时期前后出现的突变写作风格。我们认为,开发专门用于检测LLM辅助写作的特化检测器十分必要。此类检测器可在促进科学交流中对LLM参与更真实地认知方面发挥关键作用,从而解决当前标注实践中的难题。