Efficient-FedRec: Efficient Federated Learning Framework for Privacy-Preserving News Recommendation

News recommendation is critical for personalized news access. Most existing news recommendation methods rely on centralized storage of users' historical news click behavior data, which may lead to privacy concerns and hazards. Federated Learning is a privacy-preserving framework for multiple clients to collaboratively train models without sharing their private data. However, the computation and communication cost of directly learning many existing news recommendation models in a federated way are unacceptable for user clients. In this paper, we propose an efficient federated learning framework for privacy-preserving news recommendation. Instead of training and communicating the whole model, we decompose the news recommendation model into a large news model maintained in the server and a light-weight user model shared on both server and clients, where news representations and user model are communicated between server and clients. More specifically, the clients request the user model and news representations from the server, and send their locally computed gradients to the server for aggregation. The server updates its global user model with the aggregated gradients, and further updates its news model to infer updated news representations. Since the local gradients may contain private information, we propose a secure aggregation method to aggregate gradients in a privacy-preserving way. Experiments on two real-world datasets show that our method can reduce the computation and communication cost on clients while keep promising model performance.

翻译：新闻推荐对于个性化新闻访问至关重要。现有大多数新闻推荐方法依赖于集中存储用户历史新闻点击行为数据，这可能引发隐私问题和风险。联邦学习是一种隐私保护框架，允许多个客户端在不共享私有数据的情况下协同训练模型。然而，直接以联邦方式学习许多现有新闻推荐模型所产生的计算和通信开销对用户客户端而言难以承受。本文提出一种面向隐私保护新闻推荐的高效联邦学习框架。不同于训练和通信整个模型，我们将新闻推荐模型分解为在服务器端维护的大型新闻模型和在服务器与客户端共享的轻量级用户模型，其中新闻表示和用户模型在服务器与客户端之间进行通信。具体而言，客户端从服务器请求用户模型和新闻表示，并将其本地计算的梯度发送至服务器进行聚合。服务器使用聚合后的梯度更新其全局用户模型，并进一步更新新闻模型以推断更新的新闻表示。由于本地梯度可能包含隐私信息，本文提出一种安全聚合方法，以隐私保护方式聚合梯度。在两个真实数据集上的实验表明，该方法能够在保证模型性能的前提下降低客户端的计算和通信开销。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！杜克大学《因果推断》课程，全面讲述因果推理

专知会员服务

52+阅读 · 2022年10月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

105+阅读 · 2022年2月10日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日