Zero-shot entity linking (EL) aims at aligning entity mentions to unseen entities to challenge the generalization ability. Previous methods largely focus on the candidate retrieval stage and ignore the essential candidate ranking stage, which disambiguates among entities and makes the final linking prediction. In this paper, we propose a read-and-select (ReS) framework by modeling the main components of entity disambiguation, i.e., mention-entity matching and cross-entity comparison. First, for each candidate, the reading module leverages mention context to output mention-aware entity representations, enabling mention-entity matching. Then, in the selecting module, we frame the choice of candidates as a sequence labeling problem, and all candidate representations are fused together to enable cross-entity comparison. Our method achieves the state-of-the-art performance on the established zero-shot EL dataset ZESHEL with a 2.55% micro-average accuracy gain, with no need for laborious multi-phase pre-training used in most of the previous work, showing the effectiveness of both mention-entity and cross-entity interaction.
翻译:零样本实体链接旨在将实体提及与未见实体对齐,以挑战模型的泛化能力。现有方法多聚焦于候选检索阶段,而忽略了关键的候选排序阶段——该阶段通过实体消歧完成最终链接预测。本文提出一种“阅读-选择”框架,通过对实体消歧的核心组件(即提及-实体匹配与跨实体比较)进行建模。首先,在每个候选实体中,阅读模块利用提及上下文输出感知提及的实体表征,从而实现提及-实体匹配。随后,在选择模块中,我们将候选选择问题建模为序列标注任务,通过融合所有候选实体表征实现跨实体比较。本方法在标准零样本实体链接数据集ZESHEL上取得了当前最优性能,微平均准确率提升2.55%,且无需大多数先前工作中所需的多阶段预训练,充分验证了提及-实体交互与跨实体交互的有效性。