Record Linkage is the process of identifying and unifying records from various independent data sources. Existing strategies, which can be either deterministic or probabilistic, often fail to link records satisfactorily under uncertainty. This paper describes an indigenously (locally) developed fuzzy linkage method, based on fuzzy set techniques, which can effectively account for this uncertainty prevalent in the disparate data sources and address the shortcomings of the existing approaches. Extensive testing, evaluation and comparisons have demonstrated the efficacy of this fuzzy approach for record linkages.
翻译:记录链接是从多个独立数据源中识别并统一记录的过程。现有策略(包括确定性方法和概率方法)在不确定性条件下往往无法令人满意地完成记录链接。本文描述了一种基于模糊集技术的本土化(本地化)模糊链接方法,该方法能有效应对不同数据源中普遍存在的不确定性,并弥补现有方法的不足。广泛的测试、评估与比较验证了该模糊方法在记录链接中的有效性。