In this paper, we evaluate the extent of undisclosed LLM-generated content in texts from the parliaments of the United Kingdom and Sweden. In many areas, such as in journalism or in academic writing, there are often requirements to clearly disclose whether AI tools, such as LLMs, have been used. In the case of parliamentary texts, the guidelines on disclosure of AI use are more vague. However, in order to maintain transparency and retain public trust, it is generally recommended that parliamentarians should state whether or not they have used AI when writing texts, such as parliamentary motions. Here, we train an interpretable (glass-box) text classifier using pre-LLM parliamentary texts and LLM-generated versions of such texts. We then apply the classifier to a test set containing recent parliamentary texts, finding a steady increase in undisclosed LLM use, in both parliaments, from 2022 onwards.
翻译:本文评估了英国和瑞典议会文本中未公开的大型语言模型(LLM)生成内容的程度。在新闻或学术写作等多个领域,通常要求明确披露是否使用了LLM等人工智能工具。就议会文本而言,关于AI使用披露的指导方针较为模糊。然而,为保持透明度和公众信任,普遍建议议员在撰写议会动议等文本时说明是否使用了AI。本文利用LLM出现前的议会文本及由LLM生成的文本版本,训练了一个可解释的(玻璃箱式)文本分类器。随后将该分类器应用于包含近期议会文本的测试集,发现自2022年起,两国议会中未公开的LLM使用情况均呈稳步上升趋势。