A Probabilistic Fluctuation based Membership Inference Attack for Generative Models

Membership Inference Attack (MIA) identifies whether a record exists in a machine learning model's training set by querying the model. MIAs on the classic classification models have been well-studied, and recent works have started to explore how to transplant MIA onto generative models. Our investigation indicates that existing MIAs designed for generative models mainly depend on the overfitting in target models. However, overfitting can be avoided by employing various regularization techniques, whereas existing MIAs demonstrate poor performance in practice. Unlike overfitting, memorization is essential for deep learning models to attain optimal performance, making it a more prevalent phenomenon. Memorization in generative models leads to an increasing trend in the probability distribution of generating records around the member record. Therefore, we propose a Probabilistic Fluctuation Assessing Membership Inference Attack (PFAMI), a black-box MIA that infers memberships by detecting these trends via analyzing the overall probabilistic fluctuations around given records. We conduct extensive experiments across multiple generative models and datasets, which demonstrate PFAMI can improve the attack success rate (ASR) by about 27.9% when compared with the best baseline.

翻译：成员推断攻击（Membership Inference Attack, MIA）通过查询模型来判定某个记录是否存在于机器学习模型的训练集中。针对经典分类模型的MIA已被广泛研究，而近期工作开始探索如何将MIA移植至生成模型。我们的研究表明，现有针对生成模型的MIA主要依赖目标模型的过拟合现象。然而，过拟合可通过采用多种正则化技术加以避免，导致现有MIA在实际场景中表现欠佳。与过拟合不同，记忆化（memorization）是深度学习模型达到最优性能所必需的特性，使其成为更为普遍的现象。生成模型中的记忆化会导致成员记录周围的生成概率分布呈现递增趋势。基于此，我们提出基于概率波动评估的成员推断攻击（Probabilistic Fluctuation Assessing Membership Inference Attack, PFAMI），这是一种黑盒MIA方法，通过分析给定记录周围的整体概率波动来检测这些趋势，进而推断成员关系。我们在多种生成模型和数据集上进行了广泛实验，结果表明，与最优基线方法相比，PFAMI可将攻击成功率（Attack Success Rate, ASR）提升约27.9%。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日