This paper delves into enhancing the classification performance on the GoEmotions dataset, a large, manually annotated dataset for emotion detection in text. The primary goal of this paper is to address the challenges of detecting subtle emotions in text, a complex issue in Natural Language Processing (NLP) with significant practical applications. The findings offer valuable insights into addressing the challenges of emotion detection in text and suggest directions for future research, including the potential for a survey paper that synthesizes methods and performances across various datasets in this domain.
翻译:本文深入探究了在GoEmotions数据集(一个大规模人工标注的文本情感检测数据集)上提升分类性能的方法。研究的主要目标是解决文本中细微情感检测这一自然语言处理(NLP)中的复杂问题及其重要实际应用。特别地,我们对比了数据增强技术与迁移学习对大语言模型在该数据集上效果的影响。通过一系列实验,我们评估了不同方法在情感检测任务中的表现。实验结果表明,混合迁移学习与数据增强方法相比单一方法在细粒度情感识别方面具有更优的性能。研究结果为了解文本情感检测的挑战提供了宝贵见解,并为未来研究指明了方向,包括撰写总结该领域各数据集方法及性能的综述论文的潜在可能。