Beyond Retrieval: Learning Compact User Representations for Scalable LLM Personalization

Personalizing large language models requires adapting model behavior to individual users while preserving robustness and deployment-scale efficiency. Existing approaches typically personalize LLMs either at the input level, by retrieving user histories or constructing profile prompts, or at the parameter level, by maintaining user-specific parameter-efficient modules. The former makes personalization sensitive to retrieval quality and prompt design, whereas the latter incurs storage and maintenance costs that grow with the user population. To address these limitations, we propose TAP-PER (Temporal Attentive Prefix for PERsonalization), a prefix-based framework that encodes user preferences as learnable representations, eliminating explicit prompt construction and replacing heavy per-user adapters with lightweight user-state prefix embeddings. Inspired by personalized recommendation systems, TAP-PER decomposes user modeling into user-state and query-conditioned components, and incorporates temporal signals to capture the evolving nature of user interests. Experiments on six LaMP tasks show that TAP-PER consistently outperforms prompt-based and model-based baselines across classification, rating, and generation settings. Moreover, TAP-PER uses 130x fewer per-user parameters than OPPU and roughly half the total parameter footprint of PER-PCS at the 1,000-user scale, demonstrating that scalable LLM personalization can be achieved without explicit prompt construction or heavy per-user adapters.

翻译：个性化大语言模型需要在保持鲁棒性和部署效率的同时，将模型行为适配至个体用户。现有方法通常通过两种途径实现LLM个性化：在输入层面，检索用户历史或构建用户画像提示；或在参数层面，维护用户特定的参数高效模块。前者使个性化效果受限于检索质量与提示设计，后者则需承担随用户规模增长而增加的内存与维护成本。为突破上述限制，我们提出TAP-PER（时序注意力前缀个性化框架），这是一种基于前缀的架构，通过可学习表示编码用户偏好，无需显式构建提示，并以轻量级用户状态前缀嵌入替代繁重的逐用户适配器。受个性化推荐系统启发，TAP-PER将用户建模分解为用户状态与查询条件化组件，并引入时序信号以捕捉用户兴趣的演化特性。在六个LaMP任务上的实验表明，TAP-PER在分类、评分与生成三类场景中均持续优于基于提示和基于模型的基线方法。此外，在1000用户规模下，TAP-PER的每用户参数仅为OPPU的1/130，总参数量约为PER-PCS的一半，验证了无需显式提示构建或繁重逐用户适配器即可实现可扩展LLM个性化的可行性。

相关内容

TAP

关注 819

ACM应用感知TAP(ACM Transactions on Applied Perception)旨在通过发表有助于统一这些领域研究的高质量论文来增强计算机科学与心理学/感知之间的协同作用。该期刊发表跨学科研究，在跨计算机科学和感知心理学的任何主题领域都具有重大而持久的价值。所有论文都必须包含感知和计算机科学两个部分。主题包括但不限于：视觉感知：计算机图形学，科学/数据/信息可视化，数字成像，计算机视觉，立体和3D显示技术。听觉感知：听觉显示和界面，听觉听觉编码，空间声音，语音合成和识别。触觉：触觉渲染，触觉输入和感知。感觉运动知觉：手势输入，身体运动输入。感官感知：感官整合，多模式渲染和交互。官网地址：http://dblp.uni-trier.de/db/journals/tap/

迈向个性化大语言模型驱动的智能体：基础、评估与未来方向

专知会员服务

29+阅读 · 2月27日

【新书】设计大型语言模型应用：一种面向LLMs的整体方法

专知会员服务

56+阅读 · 2025年3月16日

带入您自己的知识：大型语言模型（LLM）知识扩展方法综述

专知会员服务

38+阅读 · 2025年2月21日

个性化大型语言模型综述：进展与未来方向

专知会员服务

43+阅读 · 2025年2月18日