"mdendro" is an R package that provides a comprehensive collection of linkage methods for agglomerative hierarchical clustering on a matrix of proximity data (distances or similarities), returning a multifurcated dendrogram or multidendrogram. Multidendrograms can group more than two clusters at the same time, solving the nonuniqueness problem that arises when there are ties in the data. This problem causes that different binary dendrograms are possible depending both on the order of the input data and on the criterion used to break ties. Weighted and unweighted versions of the most common linkage methods are included in the package, which also implements two parametric linkage methods. In addition, package "mdendro" provides five descriptive measures to analyze the resulting dendrograms: cophenetic correlation coefficient, space distortion ratio, agglomeration coefficient, chaining coefficient and tree balance.
翻译:“mdendro”是一个R包,它为邻近数据矩阵(距离或相似度)的凝聚层次聚类提供了全面的连接方法集合,可返回多分叉树状图或多重树状图。多重树状图能够同时将两个以上的簇进行分组,从而解决数据中存在重复值时产生的非唯一性问题。该问题会导致不同的二叉树状图可能产生,具体取决于输入数据的顺序以及用于处理重复值的标准。该软件包包含了最常见连接方法的加权和非加权版本,同时还实现了两种参数化连接方法。此外,“mdendro”包提供了五种描述性度量来分析生成的树状图:同表型相关系数、空间畸变比、凝聚系数、链式系数和树平衡度。