A Duty to Forget, a Right to be Assured? Exposing Vulnerabilities in Machine Unlearning Services

The right to be forgotten requires the removal or "unlearning" of a user's data from machine learning models. However, in the context of Machine Learning as a Service (MLaaS), retraining a model from scratch to fulfill the unlearning request is impractical due to the lack of training data on the service provider's side (the server). Furthermore, approximate unlearning further embraces a complex trade-off between utility (model performance) and privacy (unlearning performance). In this paper, we try to explore the potential threats posed by unlearning services in MLaaS, specifically over-unlearning, where more information is unlearned than expected. We propose two strategies that leverage over-unlearning to measure the impact on the trade-off balancing, under black-box access settings, in which the existing machine unlearning attacks are not applicable. The effectiveness of these strategies is evaluated through extensive experiments on benchmark datasets, across various model architectures and representative unlearning approaches. Results indicate significant potential for both strategies to undermine model efficacy in unlearning scenarios. This study uncovers an underexplored gap between unlearning and contemporary MLaaS, highlighting the need for careful considerations in balancing data unlearning, model utility, and security.

翻译：被遗忘权要求从机器学习模型中移除或“遗忘”用户数据。然而，在机器学习即服务（MLaaS）的背景下，由于服务提供商（服务器）侧缺乏训练数据，从头开始重新训练模型以完成遗忘请求是不切实际的。此外，近似遗忘进一步涉及效用（模型性能）与隐私（遗忘性能）之间的复杂权衡。本文试图探索MLaaS中遗忘服务带来的潜在威胁，特别是过度遗忘，即被遗忘的信息超出预期。我们提出了两种利用过度遗忘来衡量其对权衡平衡影响的策略，这些策略适用于黑盒访问设置，而现有的机器遗忘攻击在此设置下无法适用。通过在基准数据集上进行广泛实验，评估了这些策略在不同模型架构和代表性遗忘方法下的有效性。结果表明，这两种策略在遗忘场景中破坏模型效能的潜力显著。本研究揭示了遗忘与当代MLaaS之间尚未被充分探索的鸿沟，强调了在平衡数据遗忘、模型效用和安全性方面需要进行审慎考量。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日