Previous versions of sparse principal component analysis (PCA) have presumed that the eigen-basis (a $p \times k$ matrix) is approximately sparse. We propose a method that presumes the $p \times k$ matrix becomes approximately sparse after a $k \times k$ rotation. The simplest version of the algorithm initializes with the leading $k$ principal components. Then, the principal components are rotated with an $k \times k$ orthogonal rotation to make them approximately sparse. Finally, soft-thresholding is applied to the rotated principal components. This approach differs from prior approaches because it uses an orthogonal rotation to approximate a sparse basis. One consequence is that a sparse component need not to be a leading eigenvector, but rather a mixture of them. In this way, we propose a new (rotated) basis for sparse PCA. In addition, our approach avoids "deflation" and multiple tuning parameters required for that. Our sparse PCA framework is versatile; for example, it extends naturally to a two-way analysis of a data matrix for simultaneous dimensionality reduction of rows and columns. We provide evidence showing that for the same level of sparsity, the proposed sparse PCA method is more stable and can explain more variance compared to alternative methods. Through three applications -- sparse coding of images, analysis of transcriptome sequencing data, and large-scale clustering of social networks, we demonstrate the modern usefulness of sparse PCA in exploring multivariate data.
翻译:先前版本的稀疏主成分分析(PCA)假定特征基(一个 $p \times k$ 矩阵)近似稀疏。我们提出一种方法,假定该 $p \times k$ 矩阵在经过 $k \times k$ 旋转后变得近似稀疏。该算法的最简版本初始化为前 $k$ 个主成分。随后,通过一个 $k \times k$ 正交旋转使主成分近似稀疏。最后,对旋转后的主成分应用软阈值处理。该方法与先前方法不同,因为它利用正交旋转逼近稀疏基。其一个结果是,稀疏成分不必是主导特征向量,而是它们的混合。由此,我们提出了一种新的(旋转后的)稀疏PCA基。此外,我们的方法避免了“收缩”及其所需的多重调优参数。我们的稀疏PCA框架具有普适性;例如,它自然地扩展到数据矩阵的双向分析,以同时降低行和列的维度。我们提供的证据表明,在相同稀疏度水平下,与替代方法相比,所提出的稀疏PCA方法更稳定,并能解释更多的方差。通过三个应用——图像的稀疏编码、转录组测序数据分析以及社交网络的大规模聚类,我们展示了稀疏PCA在探索多变量数据中的现代实用性。