Remaining-data-free Machine Unlearning by Suppressing Sample Contribution

Machine unlearning (MU) is to forget data from a well-trained model, which is practically important due to the ``right to be forgotten''. The unlearned model should approach the retrained model, where the forgetting data are not involved in the training process and hence do not contribute to the retrained model. Considering the forgetting data's absence during retraining, we think unlearning should withdraw their contribution from the pre-trained model. The challenge is that when tracing the learning process is impractical, how to quantify and detach sample's contribution to the dynamic learning process using only the pre-trained model. We first theoretically discover that sample's contribution during the process will reflect in the learned model's sensitivity to it. We then practically design a novel method, namely MU-Mis (Machine Unlearning by Minimizing input sensitivity), to suppress the contribution of the forgetting data. Experimental results demonstrate that MU-Mis can unlearn effectively and efficiently without utilizing the remaining data. It is the first time that a remaining-data-free method can outperform state-of-the-art (SoTA) unlearning methods that utilize the remaining data.

翻译：机器遗忘（MU）旨在使训练良好的模型遗忘特定数据，这在实践中具有重要意义，因为它关系到“被遗忘权”。遗忘后的模型应接近重新训练的模型，其中遗忘数据不参与训练过程，因此对重新训练模型没有贡献。考虑到重新训练过程中遗忘数据的缺失，我们认为遗忘应当从预训练模型中撤回这些数据的贡献。挑战在于，当追溯学习过程不可行时，如何仅使用预训练模型来量化并剥离样本对动态学习过程的贡献。我们首先从理论上发现，样本在学习过程中的贡献会反映在已训练模型对其的敏感性上。随后，我们实际设计了一种新方法，即MU-Mis（通过最小化输入敏感性的机器遗忘），来抑制遗忘数据的贡献。实验结果表明，MU-Mis能够在不利用剩余数据的情况下，有效且高效地实现遗忘。这是首次有无需剩余数据的方法在性能上超越了利用剩余数据的最先进（SoTA）遗忘方法。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日