Beyond Retrieval: Learning Compact User Representations for Scalable LLM Personalization

Personalizing large language models requires adapting model behavior to individual users while preserving robustness and deployment-scale efficiency. Existing approaches typically personalize LLMs either at the input level, by retrieving user histories or constructing profile prompts, or at the parameter level, by maintaining user-specific parameter-efficient modules. The former makes personalization sensitive to retrieval quality and prompt design, whereas the latter incurs storage and maintenance costs that grow with the user population. To address these limitations, we propose TAP-PER (Temporal Attentive Prefix for PERsonalization), a prefix-based framework that encodes user preferences as learnable representations, eliminating explicit prompt construction and replacing heavy per-user adapters with lightweight user-state prefix embeddings. Inspired by personalized recommendation systems, TAP-PER decomposes user modeling into user-state and query-conditioned components, and incorporates temporal signals to capture the evolving nature of user interests. Experiments on six LaMP tasks show that TAP-PER consistently outperforms prompt-based and model-based baselines across classification, rating, and generation settings. Moreover, TAP-PER uses 130x fewer per-user parameters than OPPU and roughly half the total parameter footprint of PER-PCS at the 1,000-user scale, demonstrating that scalable LLM personalization can be achieved without explicit prompt construction or heavy per-user adapters.

翻译：个性化大语言模型需要根据个体用户调整模型行为，同时保持鲁棒性和部署规模的效率。现有方法通常通过检索用户历史或构建用户画像提示词在输入层面进行个性化，或通过维护用户特定的参数高效模块在参数层面实现个性化。前者让个性化效果对检索质量和提示词设计敏感，而后者则面临随用户规模增长而增加的存储与维护成本。为解决这些局限，我们提出TAP-PER（时序关注前缀个性化框架），一种基于前缀的框架，通过将用户偏好编码为可学习表示，免除了显式的提示词构建，并将繁重的各用户适配器替换为轻量级用户状态前缀嵌入。受个性化推荐系统启发，TAP-PER将用户建模分解为用户状态与查询条件两项成分，并融入时序信号以捕捉用户兴趣的演化特性。在六项LaMP任务上的实验表明，TAP-PER在分类、评分和生成场景下均持续优于基于提示词和基于模型的基线方法。此外，在千用户规模下，TAP-PER的每用户参数数仅为OPPU的1/130，总参数量约为PER-PCS的一半，证明无需显式提示词构建或繁重的个性化适配器即可实现可扩展的大语言模型个性化。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

迈向个性化大语言模型驱动的智能体：基础、评估与未来方向

专知会员服务

29+阅读 · 2月27日

【WWW2025】G-Refer：基于图检索增强的大型语言模型用于可解释推荐

专知会员服务

13+阅读 · 2025年4月8日

带入您自己的知识：大型语言模型（LLM）知识扩展方法综述

专知会员服务

38+阅读 · 2025年2月21日

个性化大型语言模型综述：进展与未来方向

专知会员服务

43+阅读 · 2025年2月18日