Open Information Extraction (OIE) aims to extract factual relational tuples from open-domain sentences. Downstream tasks use the extracted OIE tuples as facts, without examining the certainty of these facts. However, uncertainty/speculation is a common linguistic phenomenon. Existing studies on speculation detection are defined at sentence level, but even if a sentence is determined to be speculative, not all tuples extracted from it may be speculative. In this paper, we propose to study speculations in OIE and aim to determine whether an extracted tuple is speculative. We formally define the research problem of tuple-level speculation detection and conduct a detailed data analysis on the LSOIE dataset which contains labels for speculative tuples. Lastly, we propose a baseline model OIE-Spec for this new research task.
翻译:开放信息提取(OIE)旨在从开放域句子中提取事实性关系三元组。下游任务通常将提取的OIE三元组视为事实,却未验证这些事实的确定性。然而,不确定性/推测是普遍的语言现象。现有推测检测研究均定义在句子层面,但即便某句子被判定为推测句,从中提取的所有三元组也未必全含推测性。本文首次提出研究OIE中的推测现象,旨在判断提取的三元组是否具备推测性。我们正式定义了三元组级推测检测的研究问题,并对包含推测三元组标注的LSOIE数据集进行了详尽的数据分析。最后,针对该新研究任务,我们提出基准模型OIE-Spec。