The Bloom filter is a simple yet space-efficient probabilistic data structure that supports membership queries for dramatically large datasets. It is widely utilized and implemented across various industrial scenarios, often handling massive datasets that include sensitive user information necessitating privacy preservation. To address the challenge of maintaining privacy within the Bloom filter, we have developed the DPBloomfilter. This innovation integrates the classical differential privacy mechanism, specifically the Random Response technique, into the Bloom filter, offering robust privacy guarantees under the same running complexity as the standard Bloom filter. Through rigorous simulation experiments, we have demonstrated that our DPBloomfilter algorithm maintains high utility while ensuring privacy protections. To the best of our knowledge, this is the first work to provide differential privacy guarantees for the Bloom filter for membership query problems.
翻译:布隆过滤器是一种简洁高效的概率数据结构,能够支持超大规模数据集上的成员查询操作。该结构已在众多工业场景中得到广泛应用与实现,通常处理包含敏感用户信息且需隐私保护的海量数据集。为解决布隆过滤器中的隐私保护难题,我们提出了DPBloomfilter。该创新方案将经典差分隐私机制——特别是随机响应技术——集成至布隆过滤器中,在保持与标准布隆过滤器相同时间复杂度的同时提供强隐私保障。通过严格的仿真实验,我们证明DPBloomfilter算法在确保隐私保护的同时仍保持较高的数据效用。据我们所知,本研究首次为布隆过滤器在成员查询问题上提供了差分隐私保障。