DP-SAPF: Saliency-Aware Parameter Fine-tuning of Public Models for Differentially Private Image Synthesis

Differentially private (DP) image synthesis generates images that preserve the statistical characteristics of a sensitive dataset, enabling sensitive data analysis and usage while providing rigorous guarantees of privacy leakage. Existing methods fine-tune public models using DP Stochastic Gradient Descent (DP-SGD) on sensitive images to generate synthetic images. But full fine-tuning public models on sensitive images is computationally expensive, because current public models typically contain a large number of parameters. Recent work proposes heuristically using Low-Rank Adaptation (LoRA) on all attention-layer parameters of public models to reduce the number of trainable parameters. However, we argue that exhaustive LoRA coverage across all attention-layer parameters is suboptimal in a DP setting, as it leads to noise accumulation and collapse during private training. To address this issue, we propose DP-SAPF, which uses a saliency-aware strategy to identify specific target parameters for LoRA training under DP. DP-SAPF is inspired by the fact that larger gradients signify higher saliency, indicating that these parameters are most critical for the DP learning. Specifically, we feed the sensitive images into public models, compute gradients, and add noise to the gradients to satisfy DP. Then, DP-SAPF identifies the most salient parameters, those exhibiting high gradient magnitudes on sensitive images, for DP fine-tuning. Experiments on four sensitive image datasets show that DP-SAPF improves the utility and fidelity of synthetic images while requiring fewer computational resources than fine-tuning methods without parameter selection.

翻译：差分隐私（DP）图像合成能够生成保留敏感数据集统计特征的图像，在提供严格隐私泄露保证的同时，支持敏感数据的分析与使用。现有方法在敏感图像上使用差分隐私随机梯度下降（DP-SGD）对公共模型进行微调，以生成合成图像。然而，由于当前公共模型通常包含大量参数，在敏感图像上完全微调公共模型计算成本高昂。近期研究启发式地提出对公共模型所有注意力层参数采用低秩适配（LoRA），以减少可训练参数数量。但我们认为，在差分隐私场景下，对所有注意力层参数实施全覆盖LoRA并非最优策略，因为这会导致私有训练过程中噪声累积与性能崩塌。为解决此问题，我们提出DP-SAPF，该方法利用显著性感知策略识别特定的目标参数，在差分隐私约束下进行LoRA训练。DP-SAPF的灵感源于：梯度幅值越大表示参数显著性越高，表明这些参数对差分隐私学习最为关键。具体而言，我们将敏感图像输入公共模型，计算梯度，并对梯度添加噪声以满足差分隐私要求。随后，DP-SAPF识别出对敏感图像梯度幅值最高的最显著参数，进行差分隐私微调。在四个敏感图像数据集上的实验表明，与未进行参数选择的微调方法相比，DP-SAPF在提升合成图像效用与保真度的同时，所需计算资源更少。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

差分隐私全指南：从理论基础到用户期望

专知会员服务

13+阅读 · 2025年9月8日

[ICML2025]当模型知识遇见扩散模型：扩散辅助的无数据图像合成及域与类别对齐

专知会员服务

12+阅读 · 2025年6月19日

【新书】差分隐私，246页pdf

专知会员服务

27+阅读 · 2025年4月5日

【普林斯顿博士论文】在差分隐私机器学习中有效地从数据中学习并生成数据，189页pdf

专知会员服务

20+阅读 · 2024年10月18日