TMI! Finetuned Models Leak Private Information from their Pretraining Data - 专知论文

会员服务 ·

0

TMI · MoDELS · INFORMS · Learning · 推断 ·

2023 年 6 月 1 日

TMI! Finetuned Models Leak Private Information from their Pretraining Data

翻译：TMI！微调模型泄露其预训练数据中的隐私信息

John Abascal,Stanley Wu,Alina Oprea,Jonathan Ullman

Transfer learning has become an increasingly popular technique in machine learning as a way to leverage a pretrained model trained for one task to assist with building a finetuned model for a related task. This paradigm has been especially popular for privacy in machine learning, where the pretrained model is considered public, and only the data for finetuning is considered sensitive. However, there are reasons to believe that the data used for pretraining is still sensitive, making it essential to understand how much information the finetuned model leaks about the pretraining data. In this work we propose a new membership-inference threat model where the adversary only has access to the finetuned model and would like to infer the membership of the pretraining data. To realize this threat model, we implement a novel metaclassifier-based attack, TMI, that leverages the influence of memorized pretraining samples on predictions in the downstream task. We evaluate TMI on both vision and natural language tasks across multiple transfer learning settings, including finetuning with differential privacy. Through our evaluation, we find that TMI can successfully infer membership of pretraining examples using query access to the finetuned model.

翻译：迁移学习已成为机器学习中日益流行的技术，其利用为一项任务预训练的模型来辅助构建相关任务的微调模型。这一范式在机器学习隐私领域尤为常见，其中预训练模型被视为公开，而仅微调数据被视为敏感。然而，有理由相信预训练数据仍具敏感性，因此理解微调模型泄露预训练数据信息的程度至关重要。本文提出一种新的成员推断威胁模型，其中攻击者仅能访问微调模型，并试图推断预训练数据的成员身份。为实现该威胁模型，我们设计了一种基于元分类器的新型攻击方法TMI，该方法利用记忆化预训练样本在下游任务预测中的影响力。我们在包括差分隐私微调在内的多种迁移学习设置下，对视觉和自然语言任务评估了TMI。评估结果表明，TMI能够通过查询微调模型成功推断预训练样本的成员身份。

0

相关内容

TMI

TMI：IEEE Transactions on Medical Imaging。 Explanation：医学影像学的IEEE交易。 Publisher：IEEE。 SIT： http://dblp.uni-trier.de/db/journals/tmi/

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

128+阅读 · 2022年4月21日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

46+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Purdue电子与计算机工程系李海桐NanoX实验室招收AI硬件全奖博士生（2023秋季）

Purdue电子与计算机工程系李海桐NanoX实验室招收AI硬件全奖博士生（2023秋季）

机器之心

0+阅读 · 2022年10月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

专知

37+阅读 · 2018年2月21日

广义欧拉多项式的实根性

国家自然科学基金

0+阅读 · 2015年12月31日

砂土液化大变形的热力学机理与本构模型研究

国家自然科学基金

0+阅读 · 2014年12月31日

“LXR-lncRNA HULC-FOXM1通路”的鉴定及其在抗肝细胞癌中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

有机空穴传输材料/铋基氧化物杂化光催化剂的性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

向量优化问题的近似解的最优性条件

国家自然科学基金

0+阅读 · 2012年12月31日

基于集合卡尔曼滤波的污染源反演方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

miR-663在肝癌细胞增殖中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

1-3型正交异性水泥基压电复合材料的性能及其机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

硅基锗锡合金材料外延生长机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

TRAIL在动脉粥样硬化发生发展中作用机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Leveraging Label Variation in Large Language Models for Zero-Shot Text Classification

Arxiv

0+阅读 · 2023年7月24日

Addressing the Impact of Localized Training Data in Graph Neural Networks

Arxiv

0+阅读 · 2023年7月24日

Client-Level Differential Privacy via Adaptive Intermediary in Federated Medical Imaging

Arxiv

0+阅读 · 2023年7月24日

Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering

Arxiv

0+阅读 · 2023年7月21日

Epsilon*: Privacy Metric for Machine Learning Models

Arxiv

0+阅读 · 2023年7月21日

My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks

Arxiv

0+阅读 · 2023年7月20日

Unsupervised 3D registration through optimization-guided cyclical self-training

Arxiv

0+阅读 · 2023年7月20日

Pretrained Transformers for Text Ranking: BERT and Beyond

Arxiv

28+阅读 · 2020年10月13日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Low-Shot Learning from Imaginary Data

Arxiv

15+阅读 · 2018年4月3日

VIP会员

文章信息

相关主题

最新内容

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

专知会员服务

5+阅读 · 今天8:00

重新思考无人机时代的生存能力

重新思考无人机时代的生存能力

专知会员服务

4+阅读 · 今天7:44

装甲突击旅：现代战争思考、战斗与组织

装甲突击旅：现代战争思考、战斗与组织

专知会员服务

4+阅读 · 今天7:28

在人工智能加速决策环境中拓展OODA循环

在人工智能加速决策环境中拓展OODA循环

专知会员服务

4+阅读 · 今天7:18

《廉价自杀式无人机战争的军事战略影响：乌克兰与伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰与伊朗案例研究》

专知会员服务

5+阅读 · 今天7:07

军事欺骗：供作战战术指挥官使用的工具

军事欺骗：供作战战术指挥官使用的工具

专知会员服务

4+阅读 · 今天7:03

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

4+阅读 · 6月23日

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

5+阅读 · 6月23日

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

10+阅读 · 6月23日

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

4+阅读 · 6月23日

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

5+阅读 · 6月23日

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

8+阅读 · 6月23日

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

7+阅读 · 6月23日

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

4+阅读 · 6月23日

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

6+阅读 · 6月22日

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

128+阅读 · 2022年4月21日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

46+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

重新思考无人机时代的生存能力

在人工智能加速决策环境中拓展OODA循环

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

装甲突击旅：现代战争思考、战斗与组织

相关资讯

Purdue电子与计算机工程系李海桐NanoX实验室招收AI硬件全奖博士生（2023秋季）

Purdue电子与计算机工程系李海桐NanoX实验室招收AI硬件全奖博士生（2023秋季）

机器之心

0+阅读 · 2022年10月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

TorchSeg：基于pytorch的语义分割算法开源了

TorchSeg：基于pytorch的语义分割算法开源了

极市平台

20+阅读 · 2019年1月28日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

专知

37+阅读 · 2018年2月21日

相关论文

Leveraging Label Variation in Large Language Models for Zero-Shot Text Classification

Arxiv

0+阅读 · 2023年7月24日

Addressing the Impact of Localized Training Data in Graph Neural Networks

Arxiv

0+阅读 · 2023年7月24日

Client-Level Differential Privacy via Adaptive Intermediary in Federated Medical Imaging

Arxiv

0+阅读 · 2023年7月24日

Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering

Arxiv

0+阅读 · 2023年7月21日

Epsilon*: Privacy Metric for Machine Learning Models

Arxiv

0+阅读 · 2023年7月21日

My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks

Arxiv

0+阅读 · 2023年7月20日

Unsupervised 3D registration through optimization-guided cyclical self-training

Arxiv

0+阅读 · 2023年7月20日

Pretrained Transformers for Text Ranking: BERT and Beyond

Arxiv

28+阅读 · 2020年10月13日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Low-Shot Learning from Imaginary Data

Arxiv

15+阅读 · 2018年4月3日

相关基金

广义欧拉多项式的实根性

国家自然科学基金

0+阅读 · 2015年12月31日

砂土液化大变形的热力学机理与本构模型研究

国家自然科学基金

0+阅读 · 2014年12月31日

“LXR-lncRNA HULC-FOXM1通路”的鉴定及其在抗肝细胞癌中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

有机空穴传输材料/铋基氧化物杂化光催化剂的性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

向量优化问题的近似解的最优性条件

国家自然科学基金

0+阅读 · 2012年12月31日

基于集合卡尔曼滤波的污染源反演方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

miR-663在肝癌细胞增殖中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

1-3型正交异性水泥基压电复合材料的性能及其机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

硅基锗锡合金材料外延生长机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

TRAIL在动脉粥样硬化发生发展中作用机制的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员