We propose a new framework for combining entity resolution and query answering in knowledge bases (KBs) with tuple-generating dependencies (tgds) and equality-generating dependencies (egds) as rules. We define the semantics of the KB in terms of special instances that involve equivalence classes of entities and sets of values. Intuitively, the former collect all entities denoting the same real-world object, while the latter collect all alternative values for an attribute. This approach allows us to both resolve entities and bypass possible inconsistencies in the data. We then design a chase procedure that is tailored to this new framework and has the feature that it never fails; moreover, when the chase procedure terminates, it produces a universal solution, which in turn can be used to obtain the certain answers to conjunctive queries. We finally discuss challenges arising when the chase does not terminate.
翻译:我们提出了一种新框架,用于在具有元组生成依赖(tgds)和等式生成依赖(egds)作为规则的知识库(KBs)中结合实体解析与查询回答。我们基于包含实体等价类与属性值集合的特殊实例来定义知识库的语义。直观上,前者汇集指向同一真实世界对象的所有实体,后者则收集某一属性的所有备选值。该方法既能实现实体解析,又能规避数据中可能存在的矛盾。随后我们设计了一种专用于此框架的追查过程,其特点在于永不会失败;此外,当追查过程终止时,它将生成一个通用解,该解可用于获得合取查询的确定答案。最后,我们讨论了追查过程未终止时出现的挑战。