A Survey on Vertical Federated Learning: From a Layered Perspective

Vertical federated learning (VFL) is a promising category of federated learning for the scenario where data is vertically partitioned and distributed among parties. VFL enriches the description of samples using features from different parties to improve model capacity. Compared with horizontal federated learning, in most cases, VFL is applied in the commercial cooperation scenario of companies. Therefore, VFL contains tremendous business values. In the past few years, VFL has attracted more and more attention in both academia and industry. In this paper, we systematically investigate the current work of VFL from a layered perspective. From the hardware layer to the vertical federated system layer, researchers contribute to various aspects of VFL. Moreover, the application of VFL has covered a wide range of areas, e.g., finance, healthcare, etc. At each layer, we categorize the existing work and explore the challenges for the convenience of further research and development of VFL. Especially, we design a novel MOSP tree taxonomy to analyze the core component of VFL, i.e., secure vertical federated machine learning algorithm. Our taxonomy considers four dimensions, i.e., machine learning model (M), protection object (O), security model (S), and privacy-preserving protocol (P), and provides a comprehensive investigation.

翻译：纵向联邦学习（vertical federated learning, VFL）是联邦学习的一种重要范式，适用于数据按纵向划分且分布于不同参与方的场景。VFL通过整合不同参与方的特征来丰富样本描述，从而提升模型能力。相较于横向联邦学习，VFL多数情况下应用于企业间的商业合作场景，因此蕴含巨大的商业价值。近年来，VFL在学术界和工业界日益受到关注。本文从分层视角系统梳理了VFL的现有工作：从硬件层到纵向联邦系统层，研究者们对VFL的各个层面均有贡献。此外，VFL的应用已涵盖金融、医疗等广泛领域。在各层级中，我们对现有工作进行分类，并探讨其发展挑战，以期为VFL的后续研究与应用提供便利。特别地，我们设计了一种新型MOSP树分类法，用于分析VFL的核心组件——安全的纵向联邦学习算法。该分类法从机器学习模型（M）、保护对象（O）、安全模型（S）和隐私保护协议（P）四个维度展开，并进行了全面探究。

相关内容

联邦学习

关注 200

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

【腾讯等】可信赖图学习：可靠性、可解释性和隐私保护，A Survey of Trustworthy Graph Learning: Reliability, Explainability, and Privacy Protection

专知会员服务

20+阅读 · 2022年5月24日

【开放书】隐私的现代社会技术视角，459页pdf，Modern Socio-Technical Perspectives on Privacy

专知会员服务

21+阅读 · 2022年3月24日

联邦学习智慧医疗综述

专知会员服务

122+阅读 · 2021年11月27日

联邦学习隐私保护研究进展

专知会员服务

94+阅读 · 2021年7月23日