Privacy in Cloud Computing through Immersion-based Coding

Cloud computing enables users to process and store data remotely on high-performance computers and servers by sharing data over the Internet. However, transferring data to clouds causes unavoidable privacy concerns. Here, we present a synthesis framework to design coding mechanisms that allow sharing and processing data in a privacy-preserving manner without sacrificing data utility and algorithmic performance. We consider the setup where the user aims to run an algorithm in the cloud using private data. The cloud then returns some data utility back to the user (utility refers to the service that the algorithm provides, e.g., classification, prediction, AI models, etc.). To avoid privacy concerns, the proposed scheme provides tools to co-design: 1) coding mechanisms to distort the original data and guarantee a prescribed differential privacy level; 2) an equivalent-but-different algorithm (referred here to as the target algorithm) that runs on distorted data and produces distorted utility; and 3) a decoding function that extracts the true utility from the distorted one with a negligible error. Then, instead of sharing the original data and algorithm with the cloud, only the distorted data and target algorithm are disclosed, thereby avoiding privacy concerns. The proposed scheme is built on the synergy of differential privacy and system immersion tools from control theory. The key underlying idea is to design a higher-dimensional target algorithm that embeds all trajectories of the original algorithm and works on randomly encoded data to produce randomly encoded utility. We show that the proposed scheme can be designed to offer any level of differential privacy without degrading the algorithm's utility. We present two use cases to illustrate the performance of the developed tools: privacy in optimization/learning algorithms and a nonlinear networked control system.

翻译：云计算通过互联网共享数据，使用户能够将数据远程处理并存储在高性能计算机和服务器上。然而，向云端传输数据会引发不可避免的隐私问题。本文提出一种综合框架，用于设计既能保障隐私共享与处理数据、又不牺牲数据效用和算法性能的编码机制。我们考虑用户希望在云端使用私有数据运行算法的场景，云端随后向用户返回某些数据效用（效用指算法提供的服务，例如分类、预测、人工智能模型等）。为避免隐私问题，所提出的方案提供了联合设计工具：1）用于扭曲原始数据并保证规定差分隐私水平的编码机制；2）在扭曲数据上运行并产生扭曲效用的等价异形算法（此处称为目标算法）；3）从扭曲效用中提取真实效用且误差可忽略的解码函数。这样，用户只需向云端披露扭曲数据和目标算法，而非原始数据和算法，从而规避隐私问题。该方案基于差分隐私与控制理论中系统浸入方法的协同作用构建。其核心思想是设计一个更高维的目标算法，该算法嵌入原始算法的所有轨迹，并在随机编码数据上运行以产生随机编码效用。我们证明，该方案可在不降低算法效用的前提下实现任意水平的差分隐私保护。为展示所开发工具的性能，本文给出两个应用案例：优化/学习算法中的隐私保护以及非线性网络化控制系统。

相关内容

TOOLS

关注 1

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日