On the Evaluation of User Privacy in Deep Neural Networks using Timing Side Channel

Recent Deep Learning (DL) advancements in solving complex real-world tasks have led to its widespread adoption in practical applications. However, this opportunity comes with significant underlying risks, as many of these models rely on privacy-sensitive data for training in a variety of applications, making them an overly-exposed threat surface for privacy violations. Furthermore, the widespread use of cloud-based Machine-Learning-as-a-Service (MLaaS) for its robust infrastructure support has broadened the threat surface to include a variety of remote side-channel attacks. In this paper, we first identify and report a novel data-dependent timing side-channel leakage (termed Class Leakage) in DL implementations originating from non-constant time branching operation in a widely used DL framework PyTorch. We further demonstrate a practical inference-time attack where an adversary with user privilege and hard-label black-box access to an MLaaS can exploit Class Leakage to compromise the privacy of MLaaS users. DL models are vulnerable to Membership Inference Attack (MIA), where an adversary's objective is to deduce whether any particular data has been used while training the model. In this paper, as a separate case study, we demonstrate that a DL model secured with differential privacy (a popular countermeasure against MIA) is still vulnerable to MIA against an adversary exploiting Class Leakage. We develop an easy-to-implement countermeasure by making a constant-time branching operation that alleviates the Class Leakage and also aids in mitigating MIA. We have chosen two standard benchmarking image classification datasets, CIFAR-10 and CIFAR-100 to train five state-of-the-art pre-trained DL models, over two different computing environments having Intel Xeon and Intel i7 processors to validate our approach.

翻译：近期深度学习在解决复杂现实任务方面取得的进展已使其广泛应用于实际场景中。然而，这一机遇伴随着显著的内在风险——许多模型在各类应用中依赖隐私敏感数据进行训练，导致其成为隐私泄露的过度暴露威胁面。此外，基于云的机器学习即服务（MLaaS）因其强大的基础设施支持而广泛普及，进一步拓宽了威胁面，使攻击面涵盖多种远程侧信道攻击。本文首先发现并报告了深度学习实现中存在的一种新型数据依赖性时间侧信道泄漏（称为类别泄漏），其根源在于广泛使用的深度学习框架PyTorch中非常量时间分支操作。我们进一步展示了一种实用的推理时攻击：具备用户权限且仅拥有模型硬标签黑盒访问权限的攻击者能够利用类别泄漏危害MLaaS用户的隐私。深度学习模型易受成员推理攻击（MIA），此类攻击中攻击者的目标是推断特定数据是否被用于模型训练。本文作为独立案例研究，证明即便采用差分隐私（一种流行的MIA防御措施）保护的深度学习模型，在面对利用类别泄漏的对手时仍易受MIA攻击。我们通过构建常量时间分支操作开发了易于实现的防御措施，该措施既能缓解类别泄漏，也有助于减轻MIA威胁。我们选用CIFAR-10和CIFAR-100两个标准图像分类基准数据集，在配备英特尔至强和酷睿i7处理器的两种计算环境下训练五种前沿预训练深度学习模型，以验证我们的方法。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日