A Duty to Forget, a Right to be Assured? Exposing Vulnerabilities in Machine Unlearning Services

The right to be forgotten requires the removal or "unlearning" of a user's data from machine learning models. However, in the context of Machine Learning as a Service (MLaaS), retraining a model from scratch to fulfill the unlearning request is impractical due to the lack of training data on the service provider's side (the server). Furthermore, approximate unlearning further embraces a complex trade-off between utility (model performance) and privacy (unlearning performance). In this paper, we try to explore the potential threats posed by unlearning services in MLaaS, specifically over-unlearning, where more information is unlearned than expected. We propose two strategies that leverage over-unlearning to measure the impact on the trade-off balancing, under black-box access settings, in which the existing machine unlearning attacks are not applicable. The effectiveness of these strategies is evaluated through extensive experiments on benchmark datasets, across various model architectures and representative unlearning approaches. Results indicate significant potential for both strategies to undermine model efficacy in unlearning scenarios. This study uncovers an underexplored gap between unlearning and contemporary MLaaS, highlighting the need for careful considerations in balancing data unlearning, model utility, and security.

翻译：被遗忘权要求从机器学习模型中移除或“遗忘”用户数据。然而，在机器学习即服务（MLaaS）的背景下，由于服务提供商（服务器）侧缺乏训练数据，从头开始重新训练模型以满足遗忘请求是不切实际的。此外，近似遗忘进一步引入了效用（模型性能）与隐私（遗忘性能）之间的复杂权衡。本文尝试探索MLaaS中遗忘服务所带来的潜在威胁，特别是过度遗忘，即遗忘的信息超出预期。我们提出了两种策略，利用过度遗忘来衡量其对此权衡平衡的影响，在现有机器学习遗忘攻击不适用的黑盒访问设置下进行。通过在基准数据集上、跨多种模型架构和代表性遗忘方法的广泛实验，评估了这些策略的有效性。结果表明，这两种策略在遗忘场景中均具有显著削弱模型效能的潜力。本研究揭示了遗忘与当代MLaaS之间一个尚未充分探索的鸿沟，强调了在平衡数据遗忘、模型效用和安全性时需仔细考量。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日