Private Estimation with Public Data - 专知论文

会员服务 ·

0

样本复杂度 · 高斯分布 · 样本 · 数据分布 · 高斯混合 ·

2023 年 4 月 6 日

Private Estimation with Public Data

翻译：利用公共数据进行私有估计

Alex Bie,Gautam Kamath,Vikrant Singhal

from arxiv, 55 pages; updated funding acknowledgement + simulation results from NeurIPS 2022 camera-ready

We initiate the study of differentially private (DP) estimation with access to a small amount of public data. For private estimation of d-dimensional Gaussians, we assume that the public data comes from a Gaussian that may have vanishing similarity in total variation distance with the underlying Gaussian of the private data. We show that under the constraints of pure or concentrated DP, d+1 public data samples are sufficient to remove any dependence on the range parameters of the private data distribution from the private sample complexity, which is known to be otherwise necessary without public data. For separated Gaussian mixtures, we assume that the underlying public and private distributions are the same, and we consider two settings: (1) when given a dimension-independent amount of public data, the private sample complexity can be improved polynomially in terms of the number of mixture components, and any dependence on the range parameters of the distribution can be removed in the approximate DP case; (2) when given an amount of public data linear in the dimension, the private sample complexity can be made independent of range parameters even under concentrated DP, and additional improvements can be made to the overall sample complexity.

翻译：我们研究了在拥有少量公共数据的情况下进行差分私有（DP）估计的问题。对于d维高斯分布的私有估计，我们假设公共数据来自一个高斯分布，该分布与私有数据背后的高斯分布在总变差距离上可能具有衰减的相似性。我们证明，在纯DP或集中DP约束下，d+1个公共数据样本足以消除私有数据分布的范围参数对私有样本复杂度的任何依赖，而对于没有公共数据的情况，这种依赖已知是必要的。对于分离的高斯混合模型，我们假设背后的公共和私有分布相同，并考虑两种设定：（1）当给定与维度无关数量的公共数据时，私有样本复杂度可以在混合成分数量上得到多项式改进，且在近似DP情况下，可以消除对分布范围参数的任何依赖；（2）当给定与维度呈线性关系的公共数据量时，即使在集中DP下，私有样本复杂度也可独立于范围参数，并且总体样本复杂度可以进一步得到改进。

0

相关内容

样本复杂度

样本复杂度

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

专知会员服务

83+阅读 · 2023年5月1日

香港浸会大学最新《标签噪声表示学习》综述论文，全面阐述LNRL的数据、目标函数与优化策略

香港浸会大学最新《标签噪声表示学习》综述论文，全面阐述LNRL的数据、目标函数与优化策略

专知会员服务

32+阅读 · 2022年2月15日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

【ICML2020-DeepMind】小数据，大决策:小数据模式下的模型选择

专知会员服务

37+阅读 · 2020年9月14日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

专知会员服务

29+阅读 · 2020年5月19日

Uber AI NeurIPS 2019《元学习meta-learning》教程，附92页PPT下载

Uber AI NeurIPS 2019《元学习meta-learning》教程，附92页PPT下载

专知会员服务

113+阅读 · 2019年12月13日

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

专知会员服务

36+阅读 · 2019年11月15日

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

PaperWeekly

0+阅读 · 2022年10月23日

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

PaperWeekly

0+阅读 · 2022年9月30日

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

专知

16+阅读 · 2020年12月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

BAT机器学习面试题1000题（316~320题）

BAT机器学习面试题1000题（316~320题）

七月在线实验室

14+阅读 · 2018年1月18日

树上生灭过程收敛速度及p-Laplacian特征值估计

国家自然科学基金

0+阅读 · 2015年12月31日

反应扩散方程中时滞引发的不稳定性和Hopf分支

国家自然科学基金

0+阅读 · 2013年12月31日

无线传感器网络中功率受限的分布式矢量估计

国家自然科学基金

0+阅读 · 2013年12月31日

删失数据中位数回归模型的统计分析

国家自然科学基金

3+阅读 · 2012年12月31日

云计算环境下大数据本地化技术研究

国家自然科学基金

4+阅读 · 2012年12月31日

连通与设施选址问题的近似算法研究

国家自然科学基金

2+阅读 · 2012年12月31日

基于Bregman距离的一致性风险测度及其应用

国家自然科学基金

0+阅读 · 2011年12月31日

广义Kloosterman和的均值估计

国家自然科学基金

1+阅读 · 2011年12月31日

Rayleigh信道统计分析和建模

国家自然科学基金

0+阅读 · 2009年12月31日

区间删失数据下竞争风险模型研究

国家自然科学基金

0+阅读 · 2008年12月31日

Fair Differentially Private Federated Learning Framework

Arxiv

0+阅读 · 2023年5月23日

Private Statistical Estimation of Many Quantiles

Arxiv

0+阅读 · 2023年5月23日

Multiply robust estimation for causal survival analysis with treatment noncompliance

Arxiv

0+阅读 · 2023年5月22日

TPMDP: Threshold Personalized Multi-party Differential Privacy via Optimal Gaussian Mechanism

Arxiv

0+阅读 · 2023年5月22日

Privacy-Preserving Taxi-Demand Prediction Using Federated Learning

Arxiv

0+阅读 · 2023年5月21日

Improved Differentially Private Regression via Gradient Boosting

Arxiv

0+阅读 · 2023年5月20日

Effect Size Estimation in Linear Mixed Models

Arxiv

0+阅读 · 2023年5月20日

Off-policy evaluation beyond overlap: partial identification through smoothness

Arxiv

0+阅读 · 2023年5月19日

Efficient and Deterministic Search Strategy Based on Residual Projections for Point Cloud Registration

Arxiv

0+阅读 · 2023年5月19日

Meta-Learning with Implicit Gradients

Meta-Learning with Implicit Gradients

Arxiv

13+阅读 · 2019年9月10日

VIP会员

文章信息

相关主题

样本复杂度

最新内容

《无人机对海面作战影响评估》

《无人机对海面作战影响评估》

专知会员服务

9+阅读 · 7月21日

《可损耗无人系统规模化应用对美国军事转型的战略影响（2022-2030）》2026年270页

《可损耗无人系统规模化应用对美国军事转型的战略影响（2022-2030）》2026年270页

专知会员服务

8+阅读 · 7月21日

博士论文 | 后训练如何损害大模型生成多样性？SimpleStrat与Stylus

博士论文 | 后训练如何损害大模型生成多样性？SimpleStrat与Stylus

专知会员服务

3+阅读 · 7月21日

综述 | 面向5G/6G网络的LLM智能体AI：架构、协议与标准化

综述 | 面向5G/6G网络的LLM智能体AI：架构、协议与标准化

专知会员服务

5+阅读 · 7月21日

五角大楼新设无人机办公室（DRPM-UxS）将如何重塑美国无人系统格局（附美国防部设立备忘录）

五角大楼新设无人机办公室（DRPM-UxS）将如何重塑美国无人系统格局（附美国防部设立备忘录）

专知会员服务

6+阅读 · 7月21日

印度精确打击与指挥架构的断层

印度精确打击与指挥架构的断层

专知会员服务

5+阅读 · 7月20日

《NASA喷气推进实验室：高耐久轻质常驻空观测系统（HELIOS）》429页

《NASA喷气推进实验室：高耐久轻质常驻空观测系统（HELIOS）》429页

专知会员服务

7+阅读 · 7月20日

美空军AI完成F-16战斗机自主空战历史性试飞

美空军AI完成F-16战斗机自主空战历史性试飞

专知会员服务

6+阅读 · 7月20日

《美政府问责局——武器系统年度评估（2026年）：强制要求成熟技术或可推动转向快速交付》249页

《美政府问责局——武器系统年度评估（2026年）：强制要求成熟技术或可推动转向快速交付》249页

专知会员服务

8+阅读 · 7月20日

《美国陆军：通过弹性分布式模型库实现自适应AI优势》

《美国陆军：通过弹性分布式模型库实现自适应AI优势》

专知会员服务

7+阅读 · 7月20日

博士论文 | 理解与改进大语言模型推理：从反转诅咒到连续思维链

博士论文 | 理解与改进大语言模型推理：从反转诅咒到连续思维链

专知会员服务

9+阅读 · 7月20日

综述 | 终身视觉表征：持续自监督学习CSSL系统综述

综述 | 终身视觉表征：持续自监督学习CSSL系统综述

专知会员服务

9+阅读 · 7月20日

深入Project Maven：为何人工智能在战场上依然失灵

深入Project Maven：为何人工智能在战场上依然失灵

专知会员服务

15+阅读 · 7月19日

锻造未来士兵：外骨骼、基因工程与赛博格

锻造未来士兵：外骨骼、基因工程与赛博格

专知会员服务

8+阅读 · 7月19日

《无人机系统（UAS）通信网状网络试验性部署》50页报告

《无人机系统（UAS）通信网状网络试验性部署》50页报告

专知会员服务

10+阅读 · 7月19日

相关VIP内容

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

专知会员服务

83+阅读 · 2023年5月1日

香港浸会大学最新《标签噪声表示学习》综述论文，全面阐述LNRL的数据、目标函数与优化策略

香港浸会大学最新《标签噪声表示学习》综述论文，全面阐述LNRL的数据、目标函数与优化策略

专知会员服务

32+阅读 · 2022年2月15日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

【ICML2020-DeepMind】小数据，大决策:小数据模式下的模型选择

专知会员服务

37+阅读 · 2020年9月14日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

【SIGGRAPH 2020】人像阴影处理，Portrait Shadow Manipulation

专知会员服务

29+阅读 · 2020年5月19日

Uber AI NeurIPS 2019《元学习meta-learning》教程，附92页PPT下载

Uber AI NeurIPS 2019《元学习meta-learning》教程，附92页PPT下载

专知会员服务

113+阅读 · 2019年12月13日

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

专知会员服务

36+阅读 · 2019年11月15日

热门VIP内容

开通专知VIP会员享更多权益服务

《可损耗无人系统规模化应用对美国军事转型的战略影响（2022-2030）》2026年270页

综述 | 面向5G/6G网络的LLM智能体AI：架构、协议与标准化

《无人机对海面作战影响评估》

博士论文 | 后训练如何损害大模型生成多样性？SimpleStrat与Stylus

相关资讯

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

PaperWeekly

0+阅读 · 2022年10月23日

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

PaperWeekly

0+阅读 · 2022年9月30日

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

【NeurIPS 2020 Tutorial】离线强化学习:从算法到挑战，80页ppt

专知

16+阅读 · 2020年12月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

BAT机器学习面试题1000题（316~320题）

BAT机器学习面试题1000题（316~320题）

七月在线实验室

14+阅读 · 2018年1月18日

相关论文

Fair Differentially Private Federated Learning Framework

Arxiv

0+阅读 · 2023年5月23日

Private Statistical Estimation of Many Quantiles

Arxiv

0+阅读 · 2023年5月23日

Multiply robust estimation for causal survival analysis with treatment noncompliance

Arxiv

0+阅读 · 2023年5月22日

TPMDP: Threshold Personalized Multi-party Differential Privacy via Optimal Gaussian Mechanism

Arxiv

0+阅读 · 2023年5月22日

Privacy-Preserving Taxi-Demand Prediction Using Federated Learning

Arxiv

0+阅读 · 2023年5月21日

Improved Differentially Private Regression via Gradient Boosting

Arxiv

0+阅读 · 2023年5月20日

Effect Size Estimation in Linear Mixed Models

Arxiv

0+阅读 · 2023年5月20日

Off-policy evaluation beyond overlap: partial identification through smoothness

Arxiv

0+阅读 · 2023年5月19日

Efficient and Deterministic Search Strategy Based on Residual Projections for Point Cloud Registration

Arxiv

0+阅读 · 2023年5月19日

Meta-Learning with Implicit Gradients

Meta-Learning with Implicit Gradients

Arxiv

13+阅读 · 2019年9月10日

相关基金

树上生灭过程收敛速度及p-Laplacian特征值估计

国家自然科学基金

0+阅读 · 2015年12月31日

反应扩散方程中时滞引发的不稳定性和Hopf分支

国家自然科学基金

0+阅读 · 2013年12月31日

无线传感器网络中功率受限的分布式矢量估计

国家自然科学基金

0+阅读 · 2013年12月31日

删失数据中位数回归模型的统计分析

国家自然科学基金

3+阅读 · 2012年12月31日

云计算环境下大数据本地化技术研究

国家自然科学基金

4+阅读 · 2012年12月31日

连通与设施选址问题的近似算法研究

国家自然科学基金

2+阅读 · 2012年12月31日

基于Bregman距离的一致性风险测度及其应用

国家自然科学基金

0+阅读 · 2011年12月31日

广义Kloosterman和的均值估计

国家自然科学基金

1+阅读 · 2011年12月31日

Rayleigh信道统计分析和建模

国家自然科学基金

0+阅读 · 2009年12月31日

区间删失数据下竞争风险模型研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员