As some recent information security legislation endowed users with unconditional rights to be forgotten by any trained machine learning model, personalized IoT service providers have to put unlearning functionality into their consideration. The most straightforward method to unlearn users' contribution is to retrain the model from the initial state, which is not realistic in high throughput applications with frequent unlearning requests. Though some machine unlearning frameworks have been proposed to speed up the retraining process, they fail to match decentralized learning scenarios. In this paper, we design a decentralized unlearning framework called HDUS, which uses distilled seed models to construct erasable ensembles for all clients. Moreover, the framework is compatible with heterogeneous on-device models, representing stronger scalability in real-world applications. Extensive experiments on three real-world datasets show that our HDUS achieves state-of-the-art performance.
翻译:随着近期信息安全法规赋予用户对任何已训练机器学习模型的无条件被遗忘权,个性化物联网服务提供商必须将遗忘功能纳入考量。最直接的遗忘用户贡献的方法是重新从初始状态训练模型,但这在高吞吐量且频繁出现遗忘请求的应用中并不现实。尽管已有一些机器遗忘框架被提出以加速重新训练过程,但它们无法适用于去中心化学习场景。本文设计了一种名为HDUS的去中心化遗忘框架,该框架利用蒸馏种子模型为所有客户端构建可擦除集成。此外,该框架兼容异构设备端模型,在现实应用中展现出更强的可扩展性。在三个真实数据集上的大量实验表明,我们的HDUS达到了最先进的性能。