Federated PCA on Grassmann Manifold for IoT Anomaly Detection

With the proliferation of the Internet of Things (IoT) and the rising interconnectedness of devices, network security faces significant challenges, especially from anomalous activities. While traditional machine learning-based intrusion detection systems (ML-IDS) effectively employ supervised learning methods, they possess limitations such as the requirement for labeled data and challenges with high dimensionality. Recent unsupervised ML-IDS approaches such as AutoEncoders and Generative Adversarial Networks (GAN) offer alternative solutions but pose challenges in deployment onto resource-constrained IoT devices and in interpretability. To address these concerns, this paper proposes a novel federated unsupervised anomaly detection framework, FedPCA, that leverages Principal Component Analysis (PCA) and the Alternating Directions Method Multipliers (ADMM) to learn common representations of distributed non-i.i.d. datasets. Building on the FedPCA framework, we propose two algorithms, FEDPE in Euclidean space and FEDPG on Grassmann manifolds. Our approach enables real-time threat detection and mitigation at the device level, enhancing network resilience while ensuring privacy. Moreover, the proposed algorithms are accompanied by theoretical convergence rates even under a subsampling scheme, a novel result. Experimental results on the UNSW-NB15 and TON-IoT datasets show that our proposed methods offer performance in anomaly detection comparable to nonlinear baselines, while providing significant improvements in communication and memory efficiency, underscoring their potential for securing IoT networks.

翻译：随着物联网（IoT）的普及和设备互联性的日益增强，网络安全面临重大挑战，尤其是来自异常活动的威胁。虽然基于传统机器学习的入侵检测系统（ML-IDS）能有效运用监督学习方法，但其存在诸如需要标注数据和高维数据处理困难等局限性。近期无监督ML-IDS方法（如自编码器和生成对抗网络）提供了替代解决方案，但在部署到资源受限的IoT设备及可解释性方面仍存在挑战。为解决这些问题，本文提出一种新颖的联邦无监督异常检测框架FedPCA，该框架利用主成分分析（PCA）和交替方向乘子法（ADMM）来学习分布式非独立同分布数据集的共同表示。基于FedPCA框架，我们提出了两种算法：欧几里得空间中的FEDPE算法和Grassmann流形上的FEDPG算法。我们的方法支持设备级实时威胁检测与缓解，在确保隐私的同时增强网络韧性。此外，所提算法即使在子采样方案下仍具有理论收敛速率，这是一项创新性成果。在UNSW-NB15和TON-IoT数据集上的实验结果表明，我们提出的方法在异常检测性能上可与非线性基线方法相媲美，同时在通信和内存效率方面实现显著提升，凸显了其在保护IoT网络安全方面的潜力。

相关内容

PCA

关注 3

在统计中，主成分分析（PCA）是一种通过最大化每个维度的方差来将较高维度空间中的数据投影到较低维度空间中的方法。给定二维，三维或更高维空间中的点集合，可以将“最佳拟合”线定义为最小化从点到线的平均平方距离的线。可以从垂直于第一条直线的方向类似地选择下一条最佳拟合线。重复此过程会产生一个正交的基础，其中数据的不同单个维度是不相关的。这些基向量称为主成分。

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

14+阅读 · 2022年3月12日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日