Recommendation Unlearning via Influence Function

Recommendation unlearning is an emerging task to serve users for erasing unusable data (e.g., some historical behaviors) from a well-trained recommender model. Existing methods process unlearning requests by fully or partially retraining the model after removing the unusable data. However, these methods are impractical due to the high computation cost of full retraining and the highly possible performance damage of partial training. In this light, a desired recommendation unlearning method should obtain a similar model as full retraining in a more efficient manner, i.e., achieving complete, efficient and innocuous unlearning. In this work, we propose an Influence Function-based Recommendation Unlearning (IFRU) framework, which efficiently updates the model without retraining by estimating the influence of the unusable data on the model via the influence function. In the light that recent recommender models use historical data for both the constructions of the optimization loss and the computational graph (e.g., neighborhood aggregation), IFRU jointly estimates the direct influence of unusable data on optimization loss and the spillover influence on the computational graph to pursue complete unlearning. Furthermore, we propose an importance-based pruning algorithm to reduce the cost of the influence function. IFRU is innocuous and applicable to mainstream differentiable models. Extensive experiments demonstrate that IFRU achieves more than250times acceleration compared to retraining-based methods with recommendation performance comparable to full retraining.

翻译：推荐遗忘是一项新兴任务，旨在从已训练好的推荐模型中擦除不可用数据（如部分历史行为）。现有方法通过移除不可用数据后对模型进行完全或部分重训练来处理遗忘请求。然而，这些方法因完全重训练的高计算成本和部分训练可能导致的性能严重下降而缺乏实用性。为此，理想的推荐遗忘方法应以更高效的方式获得与完全重训练相似的模型，即实现完整、高效且无害的遗忘。本文提出基于影响函数的推荐遗忘框架（IFRU），通过影响函数估计不可用数据对模型的影响，无需重训练即可高效更新模型。鉴于当前推荐模型同时利用历史数据构建优化损失和计算图（如邻居聚合），IFRU联合估计不可用数据对优化损失的直接影响和对计算图的溢出影响，以实现完整遗忘。此外，我们提出基于重要性的剪枝算法以降低影响函数的计算成本。IFRU具有无害性，适用于主流可微分模型。大量实验表明，相较于基于重训练的方法，IFRU可实现超过250倍的加速，且推荐性能与完全重训练相当。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/