ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks

from arxiv, Gilardi, Fabrizio, Meysam Alizadeh, and Ma\"el Kubli. 2023. "ChatGPT Outperforms Crowd Workers for Text-Annotation Tasks". Proceedings of the National Academy of Sciences 120(30): e2305016120

Many NLP applications require manual data annotations for a variety of tasks, notably to train classifiers or evaluate the performance of unsupervised models. Depending on the size and degree of complexity, the tasks may be conducted by crowd-workers on platforms such as MTurk as well as trained annotators, such as research assistants. Using a sample of 2,382 tweets, we demonstrate that ChatGPT outperforms crowd-workers for several annotation tasks, including relevance, stance, topics, and frames detection. Specifically, the zero-shot accuracy of ChatGPT exceeds that of crowd-workers for four out of five tasks, while ChatGPT's intercoder agreement exceeds that of both crowd-workers and trained annotators for all tasks. Moreover, the per-annotation cost of ChatGPT is less than $0.003 -- about twenty times cheaper than MTurk. These results show the potential of large language models to drastically increase the efficiency of text classification.

翻译：许多自然语言处理（NLP）应用需要针对各类任务进行人工数据标注，尤其是用于训练分类器或评估无监督模型的性能。根据任务的规模和复杂程度，这些标注工作可能由MTurk等平台上的众包工人或经过训练的专业标注员（如研究助理）完成。基于2382条推文的样本，我们证明ChatGPT在相关性、立场、话题和框架检测等多个标注任务中均优于众包工人。具体而言，ChatGPT在五项任务中的四项零样本准确率超过众包工人，且其编码员间一致性在所有任务中均高于众包工人和经过训练的专业标注员。此外，ChatGPT每条标注成本低于0.003美元，约为MTurk标注成本的二十分之一。这些结果表明，大型语言模型具有显著提升文本分类效率的潜力。

相关内容

ChatGPT

关注 258

ChatGPT（全名：Chat Generative Pre-trained Transformer），美国OpenAI 研发的聊天机器人程序 [1] ，于2022年11月30日发布。ChatGPT是人工智能技术驱动的自然语言处理工具，它能够通过学习和理解人类的语言来进行对话，还能根据聊天的上下文进行互动，真正像人类一样来聊天交流，甚至能完成撰写邮件、视频脚本、文案、翻译、代码，写论文任务。 [1] https://openai.com/blog/chatgpt/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日