Exploring the Feasibility of ChatGPT for Event Extraction

Event extraction is a fundamental task in natural language processing that involves identifying and extracting information about events mentioned in text. However, it is a challenging task due to the lack of annotated data, which is expensive and time-consuming to obtain. The emergence of large language models (LLMs) such as ChatGPT provides an opportunity to solve language tasks with simple prompts without the need for task-specific datasets and fine-tuning. While ChatGPT has demonstrated impressive results in tasks like machine translation, text summarization, and question answering, it presents challenges when used for complex tasks like event extraction. Unlike other tasks, event extraction requires the model to be provided with a complex set of instructions defining all event types and their schemas. To explore the feasibility of ChatGPT for event extraction and the challenges it poses, we conducted a series of experiments. Our results show that ChatGPT has, on average, only 51.04% of the performance of a task-specific model such as EEQA in long-tail and complex scenarios. Our usability testing experiments indicate that ChatGPT is not robust enough, and continuous refinement of the prompt does not lead to stable performance improvements, which can result in a poor user experience. Besides, ChatGPT is highly sensitive to different prompt styles.

翻译：事件抽取是自然语言处理中的一项基础任务，旨在识别并提取文本中提及的事件信息。然而，由于标注数据获取成本高昂且耗时，该任务面临标注数据匮乏的挑战。以ChatGPT为代表的大语言模型的出现，为通过简单提示解决语言任务提供了可能，无需专门的数据集和微调。尽管ChatGPT在机器翻译、文本摘要和问答等任务中表现出色，但在事件抽取这类复杂任务中仍面临挑战。与其他任务不同，事件抽取要求模型配备定义所有事件类型及其模式的一套复杂指令集。为探究ChatGPT在事件抽取中的可行性及其面临的挑战，我们开展了一系列实验。结果表明，在长尾和复杂场景下，ChatGPT的平均性能仅为EEQA等任务专用模型的51.04%。可用性测试实验表明，ChatGPT的鲁棒性不足，持续优化提示词无法带来稳定的性能提升，这可能导致用户体验不佳。此外，ChatGPT对不同提示风格高度敏感。

相关内容

ChatGPT

关注 258

ChatGPT（全名：Chat Generative Pre-trained Transformer），美国OpenAI 研发的聊天机器人程序 [1] ，于2022年11月30日发布。ChatGPT是人工智能技术驱动的自然语言处理工具，它能够通过学习和理解人类的语言来进行对话，还能根据聊天的上下文进行互动，真正像人类一样来聊天交流，甚至能完成撰写邮件、视频脚本、文案、翻译、代码，写论文任务。 [1] https://openai.com/blog/chatgpt/

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【2020关键词提取】医学报告的关键词提取和结构化，Keyword extraction and structuralization of medical reports

专知会员服务

33+阅读 · 2020年5月2日