The development of emotion recognition in dialogue (ERC) has been consistently hindered by the complexity of pipeline designs, leading to ERC models that often overfit to specific datasets and dialogue patterns. In this study, we propose a novel approach, namely InstructERC, to reformulates the ERC task from a discriminative framework to a generative framework based on Large Language Models (LLMs) . InstructERC has two significant contributions: Firstly, InstructERC introduces a simple yet effective retrieval template module, which helps the model explicitly integrate multi-granularity dialogue supervision information by concatenating the historical dialog content, label statement, and emotional domain demonstrations with high semantic similarity. Furthermore, we introduce two additional emotion alignment tasks, namely speaker identification and emotion prediction tasks, to implicitly model the dialogue role relationships and future emotional tendencies in conversations. Our LLM-based plug-and-play plugin framework significantly outperforms all previous models and achieves comprehensive SOTA on three commonly used ERC datasets. Extensive analysis of parameter-efficient and data-scaling experiments provide empirical guidance for applying InstructERC in practical scenarios. Our code will be released after blind review.
翻译:摘要:对话情感识别(ERC)的发展长期受限于流水线设计的复杂性,导致ERC模型常对特定数据集与对话模式产生过拟合。本研究提出名为InstructERC的新方法,将ERC任务从判别式框架重构为基于大语言模型(LLMs)的生成式框架。InstructERC有两项重要贡献:首先,通过引入简洁高效的检索模板模块,该模块通过拼接历史对话内容、标签语句及高语义相似度的情感域示例,帮助模型显式整合多粒度对话监督信息。其次,我们引入说话人识别与情感预测两项额外情感对齐任务,隐式建模对话中的角色关系与未来情感倾向。基于LLM的即插即用插件框架显著超越所有既有模型,在三个通用ERC数据集上实现全面最优性能。关于参数高效性与数据扩展实验的深入分析,为InstructERC在实际场景中的应用提供了经验性指导。本方法代码将于盲审后公开。