Hiding in Plain Sight: Disguising Data Stealing Attacks in Federated Learning

Malicious server (MS) attacks have enabled the scaling of data stealing in federated learning to large batch sizes and secure aggregation, settings previously considered private. However, many concerns regarding client-side detectability of MS attacks were raised, questioning their practicality once they are publicly known. In this work, for the first time, we thoroughly study the problem of client-side detectability.We demonstrate that most prior MS attacks, which fundamentally rely on one of two key principles, are detectable by principled client-side checks. Further, we formulate desiderata for practical MS attacks and propose SEER, a novel attack framework that satisfies all desiderata, while stealing user data from gradients of realistic networks, even for large batch sizes (up to 512 in our experiments) and under secure aggregation. The key insight of SEER is the use of a secret decoder, which is jointly trained with the shared model. Our work represents a promising first step towards more principled treatment of MS attacks, paving the way for realistic data stealing that can compromise user privacy in real-world deployments.

翻译：恶意服务器攻击已使联邦学习中的数据窃取能够扩展至大批量训练与安全聚合场景——这些设置先前被视为具有隐私保障。然而，关于恶意服务器攻击在客户端侧可检测性的诸多担忧随之而来，质疑其一旦被公开后是否仍具实用性。本研究首次系统性地探讨了客户端侧可检测性问题。我们证明，绝大多数基于两大核心原理的现有恶意服务器攻击，均可通过原则性的客户端侧检查被检测到。进一步，我们提出了实用化恶意服务器攻击的若干设计准则，并构建了SEER这一新型攻击框架。该框架在满足所有设计准则的同时，能够从现实网络模型的梯度中窃取用户数据——即便在大批量训练（实验中最高达512）及安全聚合条件下依然有效。SEER的核心创新在于采用与共享模型联合训练的隐秘解码器。本研究为更规范地处理恶意服务器攻击迈出了关键第一步，为可在真实部署中侵害用户隐私的现实数据窃取技术奠定了基础。

相关内容

关注 0

多媒体系统（MS）期刊详细介绍了多媒体计算，通信，存储和应用的各个方面的创新研究思想，新兴技术，最新方法和工具。它包含理论，实验和调查文章。多媒体系统的覆盖范围包括：在计算机系统中集成数字视频和音频功能；多媒体信息编码和数据交换格式；数字多媒体的操作系统机制；数字视频和音频网络与通信；存储模型和结构；用于支持多媒体应用程序的方法、范式、工具和软件体系结构；多媒体应用程序和应用程序接口，以及多媒体终端系统架构。官网地址：http://dblp.uni-trier.de/db/journals/mms/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日