Who Leaked the Model? Tracking IP Infringers in Accountable Federated Learning

Federated learning (FL) emerges as an effective collaborative learning framework to coordinate data and computation resources from massive and distributed clients in training. Such collaboration results in non-trivial intellectual property (IP) represented by the model parameters that should be protected and shared by the whole party rather than an individual user. Meanwhile, the distributed nature of FL endorses a malicious client the convenience to compromise IP through illegal model leakage to unauthorized third parties. To block such IP leakage, it is essential to make the IP identifiable in the shared model and locate the anonymous infringer who first leaks it. The collective challenges call for \emph{accountable federated learning}, which requires verifiable ownership of the model and is capable of revealing the infringer's identity upon leakage. In this paper, we propose Decodable Unique Watermarking (DUW) for complying with the requirements of accountable FL. Specifically, before a global model is sent to a client in an FL round, DUW encodes a client-unique key into the model by leveraging a backdoor-based watermark injection. To identify the infringer of a leaked model, DUW examines the model and checks if the triggers can be decoded as the corresponding keys. Extensive empirical results show that DUW is highly effective and robust, achieving over $99\%$ watermark success rate for Digits, CIFAR-10, and CIFAR-100 datasets under heterogeneous FL settings, and identifying the IP infringer with $100\%$ accuracy even after common watermark removal attempts.

翻译：联邦学习（FL）作为一种有效的协作学习框架，能够协调海量分布式客户端的计算与数据资源进行模型训练。这种协作产生的模型参数构成了需要由全体参与方而非单个用户共享和保护的重要知识产权（IP）。然而，FL的分布式特性为恶意客户端提供了便利，使其能够通过向未授权的第三方非法泄露模型来侵害知识产权。为阻止此类IP泄露，必须确保共享模型中的IP可识别，并定位首先泄露模型的匿名侵权者。这些挑战共同催生了"可问责联邦学习"——要求模型具备可验证的所有权，并能在泄露事件发生后揭露侵权者身份。本文提出可解码唯一水印（DUW）以满足可问责FL的需求。具体而言，在每轮联邦学习中向客户端发送全局模型前，DUW通过后门水印注入机制将客户端唯一密钥编码至模型中。为识别泄露模型的侵权者，DUW检测模型并验证触发器是否能解码为对应密钥。大量实验结果表明，DUW具有高效性和鲁棒性，在异构FL设置下对Digits、CIFAR-10和CIFAR-100数据集的水印成功率超过99%，即便经过常见的水印移除攻击后，仍能以100%的准确率识别知识产权侵权者。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日