In this short paper, we examine the main metrics used to evaluate textual coreference and we detail some of their limitations. We show that a unique score cannot represent the full complexity of the problem at stake, and is thus uninformative, or even misleading. We propose a new way of evaluating coreference, taking into account the context (in our case, the analysis of fictions, esp. novels). More specifically, we propose to distinguish long coreference chains (corresponding to main characters), from short ones (corresponding to secondary characters), and singletons (isolated elements). This way, we hope to get more interpretable and thus more informative results through evaluation.
翻译:在这篇短文中,我们考察了用于评估文本指代消解的主要指标,并详细阐述了其部分局限性。我们证明,单一的评分无法体现所涉及问题的全部复杂性,因此缺乏信息价值,甚至可能产生误导。我们提出了一种评估指代消解的新方法,该方法考虑了语境因素(在本文中,语境指对虚构作品,尤其是小说的分析)。具体而言,我们建议区分长指代链(对应主要人物)、短指代链(对应次要人物)以及单例(孤立元素)。通过这种方式,我们期望在评估中获得更具可解释性,从而也更具信息价值的分析结果。