The development of emotion recognition in dialogue (ERC) has been consistently hindered by the complexity of pipeline designs, leading to ERC models that often overfit to specific datasets and dialogue patterns. In this study, we propose a novel approach, namely InstructERC, to reformulates the ERC task from a discriminative framework to a generative framework based on Large Language Models (LLMs) . InstructERC has two significant contributions: Firstly, InstructERC introduces a simple yet effective retrieval template module, which helps the model explicitly integrate multi-granularity dialogue supervision information by concatenating the historical dialog content, label statement, and emotional domain demonstrations with high semantic similarity. Furthermore, we introduce two additional emotion alignment tasks, namely speaker identification and emotion prediction tasks, to implicitly model the dialogue role relationships and future emotional tendencies in conversations. Our LLM-based plug-and-play plugin framework significantly outperforms all previous models and achieves comprehensive SOTA on three commonly used ERC datasets. Extensive analysis of parameter-efficient and data-scaling experiments provide empirical guidance for applying InstructERC in practical scenarios. Our code will be released after blind review.
翻译:对话情感识别(ERC)的发展长期受制于流水线设计的复杂性,导致ERC模型常过度拟合特定数据集和对话模式。本研究提出名为InstructERC的新方法,将ERC任务从判别式框架重构为基于大型语言模型(LLMs)的生成式框架。InstructERC有两项重要贡献:首先,引入简单高效的检索模板模块,通过拼接历史对话内容、标签陈述及高语义相似度的情感领域示例,帮助模型显式融合多粒度对话监督信息。其次,引入说话人识别与情感预测两项额外情感对齐任务,隐式建模对话中的角色关系与未来情感倾向。我们基于LLM的即插即用插件框架显著超越所有现有模型,在三个常用ERC数据集上实现全面最优性能。参数高效与数据扩展实验的广泛分析为InstructERC的实际应用提供了经验指导。我们的代码将在盲审后公开。