Improved Privacy-Preserving PCA Using Space-optimized Homomorphic Matrix Multiplication

Principal Component Analysis (PCA) is a pivotal technique in the fields of machine learning and data analysis. It aims to reduce the dimensionality of a dataset while minimizing the loss of information. In recent years, there have been endeavors to utilize homomorphic encryption in privacy-preserving PCA algorithms. These approaches commonly employ a PCA routine known as PowerMethod, which takes the covariance matrix as input and generates an approximate eigenvector corresponding to the primary component of the dataset. However, their performance and accuracy are constrained by the incapability of homomorphic covariance matrix computation and the absence of a universal vector normalization strategy for the PowerMethod algorithm. In this study, we propose a novel approach to privacy-preserving PCA that addresses these limitations, resulting in superior efficiency, accuracy, and scalability compared to previous approaches. We attain such efficiency and precision through the following contributions: (i) We implement space optimization techniques for a homomorphic matrix multiplication method (Jiang et al., SIGSAC 2018), making it less prone to memory saturation in parallel computation scenarios. (ii) Leveraging the benefits of this optimized matrix multiplication, we devise an efficient homomorphic circuit for computing the covariance matrix homomorphically. (iii) Utilizing the covariance matrix, we develop a novel and efficient homomorphic circuit for the PowerMethod that incorporates a universal homomorphic vector normalization strategy to enhance both its accuracy and practicality.

翻译：主成分分析（PCA）是机器学习和数据分析领域中的关键技术，旨在降低数据集维度的同时最小化信息损失。近年来，已有研究尝试在同态加密的隐私保护PCA算法中应用该技术。这些方法通常采用称为PowerMethod的PCA流程，该流程以协方差矩阵为输入，生成与数据集主成分对应的近似特征向量。然而，由于同态协方差矩阵计算的不可行性以及PowerMethod算法缺乏通用向量归一化策略，其性能与精度受到限制。本研究提出一种新颖的隐私保护PCA方法，通过克服上述限制，在效率、精度和可扩展性方面均优于先前方法。我们通过以下贡献实现高效性与精确性：（i）针对同态矩阵乘法方法（Jiang等，SIGSAC 2018）实现空间优化技术，降低其在并行计算场景中的内存饱和风险；（ii）利用优化后的矩阵乘法优势，设计高效的协方差矩阵同态计算电路；（iii）基于该协方差矩阵，开发新颖高效的PowerMethod同态电路，并引入通用同态向量归一化策略以提升其精度与实用性。

相关内容

PCA

关注 3

在统计中，主成分分析（PCA）是一种通过最大化每个维度的方差来将较高维度空间中的数据投影到较低维度空间中的方法。给定二维，三维或更高维空间中的点集合，可以将“最佳拟合”线定义为最小化从点到线的平均平方距离的线。可以从垂直于第一条直线的方向类似地选择下一条最佳拟合线。重复此过程会产生一个正交的基础，其中数据的不同单个维度是不相关的。这些基向量称为主成分。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

专知会员服务

129+阅读 · 2020年11月20日