TransFR: Transferable Federated Recommendation with Pre-trained Language Models

Federated recommendations (FRs), facilitating multiple local clients to collectively learn a global model without disclosing user private data, have emerged as a prevalent architecture for privacy-preserving recommendations. In conventional FRs, a dominant paradigm is to utilize discrete identities to represent users/clients and items, which are subsequently mapped to domain-specific embeddings to participate in model training. Despite considerable performance, we reveal three inherent limitations that can not be ignored in federated settings, i.e., non-transferability across domains, unavailability in cold-start settings, and potential privacy violations during federated training. To this end, we propose a transferable federated recommendation model with universal textual representations, TransFR, which delicately incorporates the general capabilities empowered by pre-trained language models and the personalized abilities by fine-tuning local private data. Specifically, it first learns domain-agnostic representations of items by exploiting pre-trained models with public textual corpora. To tailor for federated recommendation, we further introduce an efficient federated fine-tuning and a local training mechanism. This facilitates personalized local heads for each client by utilizing their private behavior data. By incorporating pre-training and fine-tuning within FRs, it greatly improves the adaptation efficiency transferring to a new domain and the generalization capacity to address cold-start issues. Through extensive experiments on several datasets, we demonstrate that our TransFR model surpasses several state-of-the-art FRs in terms of accuracy, transferability, and privacy.

翻译：联邦推荐（FRs）允许多个本地客户端协同学习全局模型而不泄露用户隐私数据，已成为隐私保护推荐领域的流行架构。传统FRs的主要范式是利用离散身份标识表示用户/客户端和物品，并将其映射为领域特定的嵌入向量参与模型训练。尽管性能显著，我们发现联邦场景中存在三个固有局限性：跨领域不可迁移性、冷启动场景不可用性及联邦训练中潜在的隐私泄露风险。为此，我们提出基于通用文本表示的可迁移联邦推荐模型TransFR，该模型巧妙融合预训练语言模型赋予的通用能力与本地私有数据微调获得的个性化能力。具体而言，它首先利用公开文本语料库的预训练模型学习物品的领域无关表示。为适配联邦推荐场景，我们进一步引入高效联邦微调和本地训练机制，使各客户端能利用私有行为数据实现个性化本地头部网络。通过将预训练与微调融入FRs，该方法显著提升了迁移至新领域的适应效率与解决冷启动问题的泛化能力。多数据集上的大量实验表明，我们的TransFR模型在准确性、可迁移性和隐私保护方面均优于多个现有最优联邦推荐模型。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日