UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models

The rapid advancement of diffusion models (DMs) has not only transformed various real-world industries but has also introduced negative societal concerns, including the generation of harmful content, copyright disputes, and the rise of stereotypes and biases. To mitigate these issues, machine unlearning (MU) has emerged as a potential solution, demonstrating its ability to remove undesired generative capabilities of DMs in various applications. However, by examining existing MU evaluation methods, we uncover several key challenges that can result in incomplete, inaccurate, or biased evaluations for MU in DMs. To address them, we enhance the evaluation metrics for MU, including the introduction of an often-overlooked retainability measurement for DMs post-unlearning. Additionally, we introduce UnlearnCanvas, a comprehensive high-resolution stylized image dataset that facilitates us to evaluate the unlearning of artistic painting styles in conjunction with associated image objects. We show that this dataset plays a pivotal role in establishing a standardized and automated evaluation framework for MU techniques on DMs, featuring 7 quantitative metrics to address various aspects of unlearning effectiveness. Through extensive experiments, we benchmark 5 state-of-the-art MU methods, revealing novel insights into their pros and cons, and the underlying unlearning mechanisms. Furthermore, we demonstrate the potential of UnlearnCanvas to benchmark other generative modeling tasks, such as style transfer. The UnlearnCanvas dataset, benchmark, and the codes to reproduce all the results in this work can be found at https://github.com/OPTML-Group/UnlearnCanvas.

翻译：扩散模型（DMs）的快速发展不仅改变了多个现实行业，也引发了有害内容生成、版权争议以及刻板印象与偏见加剧等负面社会问题。为缓解这些问题，机器遗忘（MU）作为一种潜在解决方案应运而生，并在多项应用中展现出移除扩散模型不良生成能力的作用。然而，通过审视现有MU评估方法，我们发现在扩散模型MU评估中存在若干关键挑战，可能导致评估结果不完整、不准确或存在偏差。为此，我们改进了MU的评估指标，包括引入扩散模型遗忘后常被忽略的保持能力度量方法。此外，我们提出了UnlearnCanvas——一个高分辨率风格化图像综合数据集，可用于评估与图像对象关联的艺术绘画风格的遗忘效果。研究表明，该数据集在建立标准化、自动化的扩散模型MU技术评估框架中起到关键作用，该框架包含7项定量指标，从多个维度衡量遗忘有效性。通过大量实验，我们对5种最先进的MU方法进行基准测试，揭示了其优缺点及底层遗忘机制的新见解。进一步地，我们展示了UnlearnCanvas在风格迁移等其他生成建模任务基准测试中的潜力。UnlearnCanvas数据集、基准测试以及本工作所有结果的可复现代码均可在https://github.com/OPTML-Group/UnlearnCanvas获取。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

专知会员服务

15+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日