The advancements of Large Language Models (LLMs) have spurred a growing interest in their application to Named Entity Recognition (NER) methods. However, existing datasets are primarily designed for traditional machine learning methods and are inadequate for LLM-based methods, in terms of corpus selection and overall dataset design logic. Moreover, the prevalent fixed and relatively coarse-grained entity categorization in existing datasets fails to adequately assess the superior generalization and contextual understanding capabilities of LLM-based methods, thereby hindering a comprehensive demonstration of their broad application prospects. To address these limitations, we propose DynamicNER, the first NER dataset designed for LLM-based methods with dynamic categorization, introducing various entity types and entity type lists for the same entity in different context, leveraging the generalization of LLM-based NER better. The dataset is also multilingual and multi-granular, covering 8 languages and 155 entity types, with corpora spanning a diverse range of domains. Furthermore, we introduce CascadeNER, a novel NER method based on a two-stage strategy and lightweight LLMs, achieving higher accuracy on fine-grained tasks while requiring fewer computational resources. Experiments show that DynamicNER serves as a robust and effective benchmark for LLM-based NER methods. Furthermore, we also conduct analysis for traditional methods and LLM-based methods on our dataset. Our code and dataset are openly available at https://github.com/Astarojth/DynamicNER.
翻译:大型语言模型(LLM)的进展激发了人们将其应用于命名实体识别(NER)方法的日益增长的兴趣。然而,现有数据集主要针对传统机器学习方法设计,在语料选择和整体数据集设计逻辑方面,对于基于LLM的方法而言存在不足。此外,现有数据集中普遍存在的固定且相对粗粒度的实体分类,未能充分评估基于LLM的方法在泛化能力和上下文理解方面的优越性,从而阻碍了全面展示其广泛的应用前景。为应对这些局限性,我们提出了DynamicNER,这是首个专为基于LLM的方法设计的、具有动态分类的NER数据集,通过为同一实体在不同上下文中引入多种实体类型和实体类型列表,更好地利用了基于LLM的NER的泛化能力。该数据集同时具备多语言和多粒度特性,涵盖8种语言和155种实体类型,语料覆盖多个不同领域。此外,我们提出了CascadeNER,一种基于两阶段策略和轻量级LLM的新型NER方法,在细粒度任务上实现了更高的准确率,同时所需计算资源更少。实验表明,DynamicNER为基于LLM的NER方法提供了一个稳健且有效的基准。此外,我们还在我们的数据集上对传统方法和基于LLM的方法进行了分析。我们的代码和数据集已在 https://github.com/Astarojth/DynamicNER 公开提供。