Deep Efficient Private Neighbor Generation for Subgraph Federated Learning

Behemoth graphs are often fragmented and separately stored by multiple data owners as distributed subgraphs in many realistic applications. Without harming data privacy, it is natural to consider the subgraph federated learning (subgraph FL) scenario, where each local client holds a subgraph of the entire global graph, to obtain globally generalized graph mining models. To overcome the unique challenge of incomplete information propagation on local subgraphs due to missing cross-subgraph neighbors, previous works resort to the augmentation of local neighborhoods through the joint FL of missing neighbor generators and GNNs. Yet their technical designs have profound limitations regarding the utility, efficiency, and privacy goals of FL. In this work, we propose FedDEP to comprehensively tackle these challenges in subgraph FL. FedDEP consists of a series of novel technical designs: (1) Deep neighbor generation through leveraging the GNN embeddings of potential missing neighbors; (2) Efficient pseudo-FL for neighbor generation through embedding prototyping; and (3) Privacy protection through noise-less edge-local-differential-privacy. We analyze the correctness and efficiency of FedDEP, and provide theoretical guarantees on its privacy. Empirical results on four real-world datasets justify the clear benefits of proposed techniques.

翻译：在众多实际应用中，庞大图数据常被多个数据拥有者以分布式子图形式碎片化存储。在不损害数据隐私的前提下，自然考虑采用子图联邦学习（子图联邦学习）场景——每个本地客户端持有全局图的子图——以获得全局泛化的图挖掘模型。针对跨子图邻居缺失导致本地子图信息传播不完整的独特挑战，先前研究通过联合学习缺失邻居生成器与图神经网络（GNN）来增强本地邻域。但其技术设计在联邦学习的效用、效率与隐私目标方面存在显著局限性。本研究提出FedDEP以全面应对子图联邦学习中的这些挑战。FedDEP包含一系列创新技术设计：(1) 通过利用潜在缺失邻居的GNN嵌入进行深度邻居生成；(2) 通过嵌入原型化实现高效伪联邦邻居生成；(3) 通过无噪声边-本地-差分隐私实现隐私保护。我们分析了FedDEP的正确性与效率，并提供了其隐私的理论保证。在四个真实数据集上的实验结果充分证明了所提技术的显著优势。

相关内容

联邦学习

关注 200

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日