The exponential growth of data collection necessitates robust privacy protections that preserve data utility. We address information disclosure against adversaries with bounded prior knowledge, modeled by an entropy constraint $H(X) \geq b$. Within this information privacy framework -- which replaces differential privacy's independence assumption with a bounded-knowledge model -- we study three core problems: maximal per-record leakage, the primal leakage-distortion tradeoff (minimizing worst-case leakage under distortion $D$), and the dual distortion minimization (minimizing distortion under leakage constraint $L$). These problems resemble classical information-theoretic ones (channel capacity, rate-distortion) but are more complex due to high dimensionality and the entropy constraint. We develop efficient alternating optimization algorithms that exploit convexity-concavity duality, with theoretical guarantees including local convergence for the primal problem and convergence to a stationary point for the dual. Experiments on binary symmetric channels and modular sum queries validate the algorithms, showing improved privacy-utility tradeoffs over classical differential privacy mechanisms. This work provides a computational framework for auditing privacy risks and designing certified mechanisms under realistic adversary assumptions.
翻译:数据收集的指数级增长需要既保留数据效用又具备鲁棒性的隐私保护措施。我们针对具有有界先验知识的对手(由熵约束 $H(X) \geq b$ 建模)研究信息披露问题。在此信息隐私框架内——该框架用有界知识模型替代差分隐私的独立性假设——我们研究了三个核心问题:最大逐记录泄漏、原始泄漏-失真权衡(在失真 $D$ 下最小化最坏情况泄漏)以及对偶失真最小化(在泄漏约束 $L$ 下最小化失真)。这些问题类似于经典信息论问题(信道容量、率失真),但由于高维度和熵约束而更为复杂。我们开发了利用凸-凹对偶性的高效交替优化算法,具有理论保证,包括原始问题的局部收敛性以及对偶问题收敛到驻点。在二进制对称信道和模和查询上的实验验证了这些算法,显示其相较于经典差分隐私机制改进了隐私-效用权衡。本工作为在现实对手假设下审计隐私风险并设计经过认证的机制提供了计算框架。