We introduce a general abstract framework for database repairing that differentiates between integrity constraints and the so-called query constraints. The former are used to model consistency and desirable properties of the data (such as functional dependencies and independencies), while the latter relates two database instances according to their answers for the query constraints. The framework also admits a distinction between hard and soft queries, allowing to preserve the answers of a core set of queries as well as defining a distance between instances based on query answers. Finally, we present an instantiation of this framework by defining logic-based metrics in K-teams (a notion recently defined for logical modelling of relational data with semiring annotations). We exemplify how various notions of repairs from the literature can be modelled in our unifying framework.
翻译:我们提出了一种通用的抽象数据库修复框架,该框架区分了完整性约束与所谓的查询约束。前者用于建模数据的一致性及期望属性(如函数依赖和独立性),而后者则根据两个数据库实例对查询约束的答案来关联它们。该框架还允许区分硬查询和软查询,既可保留核心查询集的答案,也可基于查询答案定义实例间的距离。最后,我们通过定义K-teams(一种近期提出的用于带半环标注关系数据逻辑建模的概念)中的基于逻辑的度量,展示了该框架的一个实例化。我们举例说明了文献中多种修复概念如何在我们的统一框架中进行建模。