Secure and Fast Asynchronous Vertical Federated Learning via Cascaded Hybrid Optimization

Vertical Federated Learning (VFL) attracts increasing attention because it empowers multiple parties to jointly train a privacy-preserving model over vertically partitioned data. Recent research has shown that applying zeroth-order optimization (ZOO) has many advantages in building a practical VFL algorithm. However, a vital problem with the ZOO-based VFL is its slow convergence rate, which limits its application in handling modern large models. To address this problem, we propose a cascaded hybrid optimization method in VFL. In this method, the downstream models (clients) are trained with ZOO to protect privacy and ensure that no internal information is shared. Meanwhile, the upstream model (server) is updated with first-order optimization (FOO) locally, which significantly improves the convergence rate, making it feasible to train the large models without compromising privacy and security. We theoretically prove that our VFL framework converges faster than the ZOO-based VFL, as the convergence of our framework is not limited by the size of the server model, making it effective for training large models with the major part on the server. Extensive experiments demonstrate that our method achieves faster convergence than the ZOO-based VFL framework, while maintaining an equivalent level of privacy protection. Moreover, we show that the convergence of our VFL is comparable to the unsafe FOO-based VFL baseline. Additionally, we demonstrate that our method makes the training of a large model feasible.

翻译：纵向联邦学习（Vertical Federated Learning, VFL）因使多方能够在垂直划分数据上联合训练隐私保护模型而日益受到关注。近期研究表明，应用零阶优化（Zeroth-Order Optimization, ZOO）在构建实用的VFL算法中具有诸多优势。然而，基于ZOO的VFL存在一个关键问题——收敛速度缓慢，这限制了其处理现代大型模型的应用。为解决此问题，我们提出一种VFL中的级联混合优化方法。在该方法中，下游模型（客户端）使用ZOO进行训练以保护隐私，并确保不共享任何内部信息。同时，上游模型（服务器）在本地采用一阶优化（First-Order Optimization, FOO）更新，从而显著提升收敛速度，使得在不牺牲隐私和安全性的前提下训练大型模型成为可能。我们从理论上证明，所提出的VFL框架收敛速度快于基于ZOO的VFL，因为该框架的收敛性不受服务器模型规模限制，因此能有效训练服务器端承载主要部分的大型模型。大量实验表明，本方法在保持同等隐私保护水平的同时，实现了比基于ZOO的VFL框架更快的收敛速度。此外，我们证明所提VFL的收敛性可与不安全的基于FOO的VFL基线相媲美。最后，我们验证了该方法使大型模型训练变得可行。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日