Assessing the potential of AI-assisted pragmatic annotation: The case of apologies

Certain forms of linguistic annotation, like part of speech and semantic tagging, can be automated with high accuracy. However, manual annotation is still necessary for complex pragmatic and discursive features that lack a direct mapping to lexical forms. This manual process is time-consuming and error-prone, limiting the scalability of function-to-form approaches in corpus linguistics. To address this, our study explores automating pragma-discursive corpus annotation using large language models (LLMs). We compare ChatGPT, the Bing chatbot, and a human coder in annotating apology components in English based on the local grammar framework. We find that the Bing chatbot outperformed ChatGPT, with accuracy approaching that of a human coder. These results suggest that AI can be successfully deployed to aid pragma-discursive corpus annotation, making the process more efficient and scalable. Keywords: linguistic annotation, function-to-form approaches, large language models, local grammar analysis, Bing chatbot, ChatGPT

翻译：某些形式的语言标注，如词性标注和语义标注，可以以高准确度实现自动化。然而，对于缺乏直接词汇映射的复杂语用和话语特征，人工标注仍然是必要的。这一手动过程耗时且容易出错，限制了语料库语言学中功能-形式方法的可扩展性。为解决这一问题，本研究探索使用大语言模型自动化语用语篇语料库标注。我们比较了ChatGPT、必应聊天机器人以及人工编码员在基于局部语法框架标注英语道歉成分方面的表现。研究发现，必应聊天机器人的表现优于ChatGPT，其准确度接近人工编码员。这些结果表明，人工智能可以成功应用于辅助语用语篇语料库标注，使过程更加高效且可扩展。关键词：语言标注、功能-形式方法、大语言模型、局部语法分析、必应聊天机器人、ChatGPT

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日