Machine Unlearning using Forgetting Neural Networks

Modern computer systems store vast amounts of personal data, enabling advances in AI and ML but risking user privacy and trust. For privacy reasons, it is sometimes desired for an ML model to forget part of the data it was trained on. In this paper, we introduce a novel unlearning approach based on Forgetting Neural Networks (FNNs), a neuroscience-inspired architecture that explicitly encodes forgetting through multiplicative decay factors. While FNNs had previously been studied as a theoretical construct, we provide the first concrete implementation and demonstrate their effectiveness for targeted unlearning. We propose several variants with per-neuron forgetting factors, including rank-based assignments guided by activation levels, and evaluate them on MNIST and Fashion-MNIST benchmarks. Our method systematically removes information associated with forget sets while preserving performance on retained data. Membership inference attacks confirm the effectiveness of FNN-based unlearning in erasing information about the training data from the neural network. These results establish FNNs as a promising foundation for efficient and interpretable unlearning.

翻译：现代计算机系统存储海量个人数据，在推动人工智能与机器学习发展的同时，也带来了用户隐私与信任风险。出于隐私保护需求，机器学习模型有时需要遗忘部分训练数据。本文提出一种基于遗忘神经网络的新型遗忘方法，该神经科学启发的架构通过乘性衰减因子显式编码遗忘机制。尽管遗忘神经网络此前仅作为理论框架被研究，我们首次实现了其具体架构并验证了其在定向遗忘任务中的有效性。我们提出了多种具有神经元级遗忘因子的变体，包括基于激活水平指导的秩分配方案，并在MNIST和Fashion-MNIST基准数据集上进行评估。该方法能系统性地消除与遗忘集相关的信息，同时保持对保留数据的性能表现。成员推理攻击证实了基于遗忘神经网络的遗忘方法能有效擦除神经网络中关于训练数据的信息。这些结果表明遗忘神经网络为构建高效且可解释的遗忘机制奠定了重要基础。

相关内容

神经网络

关注 5917

人工神经网络（Artificial Neural Network，即ANN ），是20世纪80 年代以来人工智能领域兴起的研究热点。它从信息处理角度对人脑神经元网络进行抽象，建立某种简单模型，按不同的连接方式组成不同的网络。在工程与学术界也常直接简称为神经网络或类神经网络。神经网络是一种运算模型，由大量的节点（或称神经元）之间相互联接构成。每个节点代表一种特定的输出函数，称为激励函数（activation function）。每两个节点间的连接都代表一个对于通过该连接信号的加权值，称之为权重，这相当于人工神经网络的记忆。网络的输出则依网络的连接方式，权重值和激励函数的不同而不同。而网络自身通常都是对自然界某种算法或者函数的逼近，也可能是对一种逻辑策略的表达。最近十多年来，人工神经网络的研究工作不断深入，已经取得了很大的进展，其在模式识别、智能机器人、自动控制、预测估计、生物、医学、经济等领域已成功地解决了许多现代计算机难以解决的实际问题，表现出了良好的智能特性。

大模型如何遗忘不良知识？最新《生成式人工智能中的机器遗忘》综述

专知会员服务

24+阅读 · 2024年8月1日

机器遗忘综述：技术与新出现的隐私风险

专知会员服务

24+阅读 · 2024年6月16日

机器遗忘：分类、指标、应用、挑战与展望

专知会员服务

36+阅读 · 2024年3月16日