Mobile GUI agents powered by Multimodal Large Language Models (MLLMs) can execute complex tasks on mobile devices. Despite this progress, most existing systems still optimize task success or efficiency, neglecting users' privacy personalization. In this paper, we study the often-overlooked problem of agent personalization. We observe that personalization can induce systematic structural heterogeneity in execution trajectories. For example, privacy-first users often prefer protective actions, e.g., refusing permissions, logging out, and minimizing exposure, leading to logically different execution trajectories from utility-first users. Such variable-length and structurally different trajectories make standard preference optimization unstable and less informative. To address this issue, we propose Trajectory Induced Preference Optimization (TIPO), which uses preference-intensity weighting to emphasize key privacy-related steps and padding gating to suppress alignment noise. Results on our Privacy Preference Dataset show that TIPO improves persona alignment and distinction while preserving strong task executability, achieving 65.60% SR, 46.22 Compliance, and 66.67% PD, outperforming existing optimization methods across various GUI tasks. The code and dataset will be publicly released at https://github.com/Zhixin-L/TIPO.
翻译:由多模态大语言模型驱动的移动GUI代理能够在移动设备上执行复杂任务。尽管取得进展,现有系统仍主要优化任务成功率或效率,忽视了用户的隐私个性化需求。本文研究这一常被忽视的代理个性化问题。我们观察到个性化会引发生成轨迹的系统性结构异质性:例如,隐私优先型用户倾向于保护性操作(如拒绝权限、注销账户、最小化暴露),导致其执行轨迹在逻辑上与效用优先型用户存在差异。这种变长且结构不同的轨迹使得标准偏好优化不稳定且信息量不足。为解决该问题,我们提出轨迹诱导偏好优化方法,通过偏好强度加权突出关键隐私相关步骤,并利用填充门控机制抑制对齐噪声。在隐私偏好数据集上的实验表明,TIPO在保持强任务执行能力的同时提升了角色对齐性与区分度,在多项GUI任务中实现了65.60%的成功率、46.22的合规性及66.67%的隐私区分度,性能优于现有优化方法。代码与数据集将发布于https://github.com/Zhixin-L/TIPO。