This paper presents and evaluates a set of methods to classify individual Scopus publications using their references back to the second generation, where each publication can be assigned fractionally into up to five ASJC (All Science Journal Classifications) categories, excluding the Multidisciplinary area and the miscellaneous categories. Based on proposals by Glanzel et al. (1999a, 1999b, 2021), some additional parameters are established that allow different results to be obtained depending on how category membership is weighted or how the acceptance thresholds for multiple assignments are established. Various classifications are obtained, and then compared with each other, with the original ASJC Scopus journal classification, and with the AAC (Authors Assignation Collection) classification of a previous study (Alvarez-Llorente et al., 2023) in which the papers corresponding authors assign them the most appropriate categories. Classifications in which a high threshold is set for allowing assignments to multiple categories, combined with the use of first- and second-generation references and averaging over the number of references, provide the most promising results, improving over other reference-based reclassification proposals in terms of granularity, and over the Scopus classification itself in such aspects as the homogeneity of the publications assigned to a category. They also show greater coincidence with the AAC classification.
翻译:本文提出并评估了一组方法,利用回溯至第二代的参考文献对Scopus单篇出版物进行分类,其中每篇出版物可被分数式地分配至最多五个ASJC(全科学期刊分类)类别,排除多学科领域及其他杂项类别。基于Glanzel等人(1999a, 1999b, 2021)的提议,本文建立了若干附加参数,这些参数允许根据类别隶属度的加权方式或多重分配接受阈值的设定方式获得不同结果。我们得到多种分类结果,随后将其相互比较,并与原始ASJC Scopus期刊分类及先前研究(Alvarez-Llorente等人,2023)中的AAC(作者分配集合)分类进行对比——后者由论文通讯作者为其指定最合适的类别。研究表明,通过设定较高的多重类别分配阈值,并结合使用第一代与第二代参考文献以及对参考文献数量取平均值的方法,可获得最具前景的结果;这些分类在粒度上优于其他基于参考文献的重分类方案,并在分配至同一类别的出版物同质性等方面胜过Scopus自身的分类。此外,它们还与AAC分类展现出更高的一致性。