Can ChatGPT Reproduce Human-Generated Labels? A Study of Social Computing Tasks

The release of ChatGPT has uncovered a range of possibilities whereby large language models (LLMs) can substitute human intelligence. In this paper, we seek to understand whether ChatGPT has the potential to reproduce human-generated label annotations in social computing tasks. Such an achievement could significantly reduce the cost and complexity of social computing research. As such, we use ChatGPT to relabel five seminal datasets covering stance detection (2x), sentiment analysis, hate speech, and bot detection. Our results highlight that ChatGPT does have the potential to handle these data annotation tasks, although a number of challenges remain. ChatGPT obtains an average accuracy 0.609. Performance is highest for the sentiment analysis dataset, with ChatGPT correctly annotating 64.9% of tweets. Yet, we show that performance varies substantially across individual labels. We believe this work can open up new lines of analysis and act as a basis for future research into the exploitation of ChatGPT for human annotation tasks.

翻译：ChatGPT的发布揭示了大型语言模型（LLMs）替代人类智能的多种可能性。本文旨在探究ChatGPT是否具备在社会计算任务中复现人类生成的标签标注的潜力。此类成果有望显著降低社会计算研究的成本与复杂性。为此，我们使用ChatGPT对涵盖立场检测（2项）、情感分析、仇恨言论与机器人检测的五个经典数据集进行重新标注。结果表明，ChatGPT确实具有处理这些数据标注任务的潜力，但仍面临若干挑战。ChatGPT的平均准确率达到0.609，其中情感分析数据集的性能最优，正确标注了64.9%的推文。然而，我们发现不同标注标签的性能差异显著。我们认为，本研究可为探索ChatGPT在人类标注任务中的应用开辟新的分析方向，并为后续研究提供基础。

相关内容

ChatGPT

关注 258

ChatGPT（全名：Chat Generative Pre-trained Transformer），美国OpenAI 研发的聊天机器人程序 [1] ，于2022年11月30日发布。ChatGPT是人工智能技术驱动的自然语言处理工具，它能够通过学习和理解人类的语言来进行对话，还能根据聊天的上下文进行互动，真正像人类一样来聊天交流，甚至能完成撰写邮件、视频脚本、文案、翻译、代码，写论文任务。 [1] https://openai.com/blog/chatgpt/

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

MIT-深度学习Deep Learning State of the Art in 2020，87页ppt

专知会员服务

63+阅读 · 2020年2月17日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日