The delimitation of biological species, i.e., deciding which individuals belong to the same species and whether and how many different species are represented in a data set, is key to the conservation of biodiversity. Much existing work uses only genetic data for species delimitation, often employing some kind of cluster analysis. This can be misleading, because geographically distant groups of individuals can be genetically quite different even if they belong to the same species. This paper investigates the problem of testing whether two potentially separated groups of individuals can belong to a single species or not based on genetic and spatial data. Various approaches are compared (some of which already exist in the literature) based on simulated metapopulations generated with SLiM and GSpace - two software packages that can simulate spatially-explicit genetic data at an individual level. Approaches involve partial Mantel testing, maximum likelihood mixed-effects models with a population effect, and jackknife-based homogeneity tests. A key challenge is that most tests perform on genetic and geographical distance data, violating standard independence assumptions. Simulations showed that partial Mantel tests and mixed-effects models have larger power than jackknife-based methods, but tend to display type-I-error rates slightly above the significance level. Moreover, a multiple regression model neglecting the dependence in the dissimilarities did not show inflated type-I-error rate. An application on brassy ringlets concludes the paper.
翻译:生物物种的界定——即确定哪些个体属于同一物种,以及数据集是否包含及包含多少不同物种——是生物多样性保护的关键。现有研究多仅依赖遗传数据进行物种界定,通常采用某种聚类分析方法。这种方法可能产生误导,因为即使属于同一物种,地理上相隔较远的个体群在遗传上也可能存在显著差异。本文研究如何基于遗传与空间数据判断两个可能分离的个体群是否属于同一物种的问题。基于SLiM和GSpace(两款可在个体层面模拟空间显式遗传数据的软件包)生成的模拟集合种群,本文比较了多种方法(其中部分已有文献记载)。具体方法包括偏Mantel检验、含群体效应的最大似然混合效应模型,以及基于刀切法的同质性检验。核心挑战在于大多数检验基于遗传与地理距离数据进行,违反了标准独立性假设。模拟结果显示:偏Mantel检验和混合效应模型的统计功效高于基于刀切法的方法,但其第一类错误率略高于显著性水平。此外,忽略相异度间依赖性的多元回归模型并未表现出膨胀的第一类错误率。本文最后以铜色环蝶的应用实例作为总结。