Developing interpretable machine learning models has become an increasingly important issue. One way in which data scientists have been able to develop interpretable models has been to use dimension reduction techniques. In this paper, we examine several dimension reduction techniques including two recent approaches developed in the network psychometrics literature called exploratory graph analysis (EGA) and unique variable analysis (UVA). We compared EGA and UVA with two other dimension reduction techniques common in the machine learning literature (principal component analysis and independent component analysis) as well as no reduction to the variables real data. We show that EGA and UVA perform as well as the other reduction techniques or no reduction. Consistent with previous literature, we show that dimension reduction can decrease, increase, or provide the same accuracy as no reduction of variables. Our tentative results find that dimension reduction tends to lead to better performance when used for classification tasks.
翻译:开发可解释的机器学习模型已成为一个日益重要的问题。数据科学家能够开发可解释模型的一种方式是使用降维技术。在本文中,我们研究了多种降维技术,包括网络心理测量学文献中提出的两种最新方法:探索性图分析(EGA)和唯一变量分析(UVA)。我们将EGA和UVA与机器学习文献中常见的另外两种降维技术(主成分分析和独立成分分析)以及不对真实数据变量进行降维的情况进行了比较。研究表明,EGA和UVA与其他降维技术或不降维方法表现相当。与先前文献一致,我们发现降维可能降低、提高或保持与不降维相同的准确性。初步结果表明,在分类任务中,降维往往能带来更好的性能。