Hiding in Plain Sight: Disguising Data Stealing Attacks in Federated Learning

Malicious server (MS) attacks have enabled the scaling of data stealing in federated learning to large batch sizes and secure aggregation, settings previously considered private. However, many concerns regarding client-side detectability of MS attacks were raised, questioning their practicality once they are publicly known. In this work, for the first time, we thoroughly study the problem of client-side detectability.We demonstrate that most prior MS attacks, which fundamentally rely on one of two key principles, are detectable by principled client-side checks. Further, we formulate desiderata for practical MS attacks and propose SEER, a novel attack framework that satisfies all desiderata, while stealing user data from gradients of realistic networks, even for large batch sizes (up to 512 in our experiments) and under secure aggregation. The key insight of SEER is the use of a secret decoder, which is jointly trained with the shared model. Our work represents a promising first step towards more principled treatment of MS attacks, paving the way for realistic data stealing that can compromise user privacy in real-world deployments.

翻译：恶意服务器攻击已使得联邦学习中的数据窃取得以扩展至大批次规模和安全聚合场景，而这些设置此前被认为具有隐私保护性。然而，针对恶意服务器攻击在客户端侧的可检测性引发了诸多担忧，质疑其一旦公开后的实际可行性。本研究首次系统性地探讨了客户端侧的可检测性问题。我们证明，绝大多数现有恶意服务器攻击（本质上依赖于两种核心原理之一）均可通过原则性的客户端检测手段被识别。进一步地，我们制定了实用化恶意服务器攻击的设计准则，并提出SEER这一新型攻击框架，该框架在满足所有设计准则的同时，能够从现实网络的梯度中窃取用户数据——即便在大批次（实验中最高达512）和安全聚合条件下依然有效。SEER的核心创新在于使用一个与共享模型联合训练的隐蔽解码器。本工作为恶意服务器攻击的更原则化处理迈出了富有前景的第一步，为实际部署中可能损害用户隐私的现实数据窃取铺平道路。

相关内容

关注 0

多媒体系统（MS）期刊详细介绍了多媒体计算，通信，存储和应用的各个方面的创新研究思想，新兴技术，最新方法和工具。它包含理论，实验和调查文章。多媒体系统的覆盖范围包括：在计算机系统中集成数字视频和音频功能；多媒体信息编码和数据交换格式；数字多媒体的操作系统机制；数字视频和音频网络与通信；存储模型和结构；用于支持多媒体应用程序的方法、范式、工具和软件体系结构；多媒体应用程序和应用程序接口，以及多媒体终端系统架构。官网地址：http://dblp.uni-trier.de/db/journals/mms/

加速图神经网络推理，121页ppt，普林斯顿大学JAVIER DUARTE主讲

专知会员服务

33+阅读 · 2022年6月13日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日