MemHunter: Automated and Verifiable Memorization Detection at Dataset-scale in LLMs

Large language models (LLMs) have been shown to memorize and reproduce content from their training data, raising significant privacy concerns, especially with web-scale datasets. Existing methods for detecting memorization are largely sample-specific, relying on manually crafted or discretely optimized memory-inducing prompts generated on a per-sample basis, which become impractical for dataset-level detection due to the prohibitive computational cost of iterating over all samples. In real-world scenarios, data owners may need to verify whether a susceptible LLM has memorized their dataset, particularly if the LLM may have collected the data from the web without authorization. To address this, we introduce \textit{MemHunter}, which trains a memory-inducing LLM and employs hypothesis testing to efficiently detect memorization at the dataset level, without requiring sample-specific memory inducing. Experiments on models such as Pythia and Llama-2 demonstrate that \textit{MemHunter} can extract up to 40\% more training data than existing methods under constrained time resources and reduce search time by up to 80\% when integrated as a plug-in. Crucially, \textit{MemHunter} is the first method capable of dataset-level memorization detection, providing an indispensable tool for assessing privacy risks in LLMs that are powered by vast web-sourced datasets.

翻译：大型语言模型（LLMs）已被证实能够记忆并复现其训练数据中的内容，这引发了严重的隐私担忧，尤其是在涉及网络规模数据集的情况下。现有的记忆检测方法大多针对特定样本，依赖于人工构建或离散优化的记忆诱导提示，且需逐样本生成，这在数据集级检测中因遍历所有样本的计算成本过高而变得不切实际。在实际场景中，数据所有者可能需要验证易受影响的LLM是否记忆了其数据集，特别是当LLM可能未经授权从网络收集了这些数据时。为解决此问题，我们提出了\textit{MemHunter}，该方法通过训练一个记忆诱导LLM并采用假设检验，无需样本特定的记忆诱导即可高效实现数据集级的记忆检测。在Pythia和Llama-2等模型上的实验表明，在有限时间资源下，\textit{MemHunter}能够比现有方法多提取高达40%的训练数据，且作为插件集成时可减少高达80%的搜索时间。至关重要的是，\textit{MemHunter}是首个能够实现数据集级记忆检测的方法，为评估基于海量网络数据集的LLMs的隐私风险提供了不可或缺的工具。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日