UnlearnCanvas: A Stylized Image Dataset to Benchmark Machine Unlearning for Diffusion Models

The rapid advancement of diffusion models (DMs) has not only transformed various real-world industries but has also introduced negative societal concerns, including the generation of harmful content, copyright disputes, and the rise of stereotypes and biases. To mitigate these issues, machine unlearning (MU) has emerged as a potential solution, demonstrating its ability to remove undesired generative capabilities of DMs in various applications. However, by examining existing MU evaluation methods, we uncover several key challenges that can result in incomplete, inaccurate, or biased evaluations for MU in DMs. To address them, we enhance the evaluation metrics for MU, including the introduction of an often-overlooked retainability measurement for DMs post-unlearning. Additionally, we introduce UnlearnCanvas, a comprehensive high-resolution stylized image dataset that facilitates us to evaluate the unlearning of artistic painting styles in conjunction with associated image objects. We show that this dataset plays a pivotal role in establishing a standardized and automated evaluation framework for MU techniques on DMs, featuring 7 quantitative metrics to address various aspects of unlearning effectiveness. Through extensive experiments, we benchmark 5 state-of-the-art MU methods, revealing novel insights into their pros and cons, and the underlying unlearning mechanisms. Furthermore, we demonstrate the potential of UnlearnCanvas to benchmark other generative modeling tasks, such as style transfer. The UnlearnCanvas dataset, benchmark, and the codes to reproduce all the results in this work can be found at https://github.com/OPTML-Group/UnlearnCanvas.

翻译：扩散模型（DMs）的飞速发展不仅变革了多个实际行业，也引发了有害内容生成、版权争议以及刻板印象与偏见加剧等社会负面问题。为缓解这些问题，机器遗忘（MU）作为一种潜在解决方案应运而生，其在移除DM不良生成能力方面展现了显著效果。然而，通过剖析现有MU评估方法，我们发现了若干关键挑战，这些挑战可能导致DM中MU评估的片面性、不准确性或偏差。为此，我们改进了MU评估指标，包括引入DM遗忘后常被忽视的保留能力度量。此外，我们提出了UnlearnCanvas——一个高分辨率风格化图像综合数据集，用于评估与关联图像对象相结合的艺术绘画风格遗忘效果。研究表明，该数据集在构建DM上MU技术的标准化、自动化评估框架中发挥关键作用，包含7项量化指标以全面衡量遗忘效果。通过大量实验，我们对5种前沿MU方法进行了基准测试，揭示了其优势、不足及底层遗忘机制。进一步地，我们展示了UnlearnCanvas在风格迁移等其他生成建模任务基准测试中的潜力。本工作中使用的UnlearnCanvas数据集、基准测试及复现所有结果的代码已开源至https://github.com/OPTML-Group/UnlearnCanvas。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

专知会员服务

15+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日