Word sense disambiguation (WSD) is one of the main challenges in Computational Linguistics. TreeMatch is a WSD system originally developed using data from SemEval 2007 Task 7 (Coarse-grained English All-words Task) that has been adapted for use in SemEval 2010 Task 17 (All-words Word Sense Disambiguation on a Specific Domain). The system is based on a fully unsupervised method using dependency knowledge drawn from a domain specific knowledge base that was built for this task. When evaluated on the task, the system precision performs above the Most Frequent Selection baseline.
翻译:词义消歧(WSD)是计算语言学的主要挑战之一。TreeMatch是一个最初基于SemEval 2007 Task 7(粗粒度英语全词任务)数据开发的WSD系统,现已适配应用于SemEval 2010 Task 17(特定领域全词词义消歧任务)。该系统采用完全无监督方法,其核心是利用针对该任务构建的领域特定知识库所提取的依存知识。在任务评估中,该系统精度表现优于最频繁选择基线。