We present the findings of SemEval-2023 Task 2 on Fine-grained Multilingual Named Entity Recognition (MultiCoNER 2). Divided into 13 tracks, the task focused on methods to identify complex fine-grained named entities (like WRITTENWORK, VEHICLE, MUSICALGRP) across 12 languages, in both monolingual and multilingual scenarios, as well as noisy settings. The task used the MultiCoNER V2 dataset, composed of 2.2 million instances in Bangla, Chinese, English, Farsi, French, German, Hindi, Italian., Portuguese, Spanish, Swedish, and Ukrainian. MultiCoNER 2 was one of the most popular tasks of SemEval-2023. It attracted 842 submissions from 47 teams, and 34 teams submitted system papers. Results showed that complex entity types such as media titles and product names were the most challenging. Methods fusing external knowledge into transformer models achieved the best performance, and the largest gains were on the Creative Work and Group classes, which are still challenging even with external knowledge. Some fine-grained classes proved to be more challenging than others, such as SCIENTIST, ARTWORK, and PRIVATECORP. We also observed that noisy data has a significant impact on model performance, with an average drop of 10% on the noisy subset. The task highlights the need for future research on improving NER robustness on noisy data containing complex entities.
翻译:我们介绍了SemEval-2023任务2——细粒度多语言命名实体识别(MultiCoNER 2)的研究结果。该任务划分为13个子赛道,聚焦于在单语和多语言场景及噪声环境下,识别12种语言中复杂细粒度命名实体(如WRITTENWORK、VEHICLE、MUSICALGRP)的方法。任务采用MultiCoNER V2数据集,包含孟加拉语、中文、英语、波斯语、法语、德语、印地语、意大利语、葡萄牙语、西班牙语、瑞典语和乌克兰语共220万个实例。MultiCoNER 2是SemEval-2023最受欢迎的任务之一,吸引了47个团队的842份提交,其中34个团队提交了系统论文。结果表明,媒体标题和产品名称等复杂实体类型最具挑战性。将外部知识融入Transformer模型的方法取得了最佳性能,在创意作品(Creative Work)和团体(Group)类别上提升最为显著,但即使借助外部知识,这些类别仍具挑战性。部分细粒度类别(如SCIENTIST、ARTWORK、PRIVATECORP)的难度高于其他类别。我们还观察到噪声数据对模型性能有显著影响,在噪声子集上平均性能下降10%。该任务凸显了未来需提升NER在含复杂实体的噪声数据中的鲁棒性。