Task-Aware Automated User Profile Generation for Recommendation Simulation Using Large Language Models

Large Language Model (LLM)-based agent simulation has emerged as a promising approach to meet the increasing demand for real-time and rigorous evaluation in modern recommender systems. A typical LLM-driven simulation framework comprises three essential components: the profile module, memory module, and action module. However, existing studies have primarily concentrated on enhancing the memory and action modules, with limited attention to profile generation, which plays a pivotal role in ensuring realistic agent behaviours and aligning simulated interactions with real user dynamics. Moreover, the scarcity of datasets specifically designed for recommendation simulations has led to heavy reliance on manually crafted profiles, significantly limiting the scalability and generalisability of simulation frameworks across different datasets. To address these challenges, this work proposes an Automated Profile Generation Framework for Recommendation Simulation, APG4RecSim, that constructs realistic, coherent, and robust user profiles with minimal supervision. Extensive experiments on three benchmark datasets demonstrate that APG4RecSim achieves the best overall performance on discrimination, ranking, and rating tasks, improving ranking quality by up to 7% in nDCG@10 and reducing rating distribution divergence by 8% in JSD compared to existing profile-generation baselines. Beyond overall performance gains, our results show that profiles generated by APG4RecSim are resilient to popularity- and position-induced biases and maintain stable performance across datasets and different LLMs.

翻译：基于大语言模型（LLM）的智能体模拟已成为满足现代推荐系统对实时化、严格化评估需求的前沿方法。典型的LLM驱动模拟框架包含三个核心模块：画像模块、记忆模块和动作模块。然而，现有研究主要集中于增强记忆与动作模块，对作为确保智能体行为真实性、使模拟交互贴近真实用户动态关键因素的画像生成关注有限。此外，专门用于推荐模拟的数据集匮乏导致严重依赖人工构建画像，显著限制了模拟框架在不同数据集间的可扩展性和泛化能力。针对上述挑战，本文提出面向推荐模拟的自动化画像生成框架APG4RecSim，该框架能以极低监督成本构建真实、连贯且鲁棒的用户画像。在三个基准数据集上的大量实验表明，APG4RecSim在判别、排序和评分任务中均取得最佳整体性能，与现有画像生成基线相比，在nDCG@10指标上提升排序质量达7%，在JSD指标上降低评分分布差异达8%。除整体性能提升外，实验结果表明，APG4RecSim生成的画像对流行度偏差和位置偏差具有鲁棒性，且能在不同数据集与不同LLM间保持稳定性能。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

大语言模型智能体中的外显化机制：记忆、技能、协议与评测基准工程综述

专知会员服务

35+阅读 · 4月19日

迈向个性化大语言模型驱动的智能体：基础、评估与未来方向

专知会员服务

29+阅读 · 2月27日

基于大语言模型智能体的社会认知模拟

专知会员服务

19+阅读 · 2月22日

【AAAI2026】AutoTool：面向大语言模型智能体的高效工具选择方法

专知会员服务

19+阅读 · 2025年11月19日