PrivScope: Task-scoped Disclosure Control for Hybrid Agentic Systems

Hybrid local--cloud agents enrich user requests with context from persistent working state before delegating capability-intensive subtasks to a cloud language model (CLM). While this enrichment can improve task success, it also exposes unnecessary information in the cloud-bound payload, including task-irrelevant context, carryover from prior workflows, and overly specific sensitive details, resulting in \emph{over-disclosure}. Existing solutions either isolate workflows to limit cross-workflow leakage or apply general-purpose sanitization that does not reason over LC-assembled payload scope. We present \textsc{PrivScope}, a trusted on-device payload governor that enforces \emph{task-scoped disclosure} at the local--CLM boundary, without requiring cloud-side changes. Its key idea: sensitive information should reach the cloud only when required for the delegated subtask, and then only in the least revealing form preserving utility. \textsc{PrivScope} extracts disclosure units from the assembled payload and keeps direct identifiers and account-linked values on device. The remaining units pass through cloud-necessity control, which determines what is actually needed; units that must reach the cloud are abstracted to the least-specific representation sufficient for the task. On 100 medical-booking workflows across three commercial CLMs, \textsc{PrivScope} eliminates profile leakage (0.0\% vs.\ 17.7\%), more than halves attacker re-identification (23.1\% vs.\ 64.3\%), and achieves the highest candidate recall on every CLM tested while preserving task success close to the unprotected baseline on GPT-4o-mini and Gemini 2.5 Flash. Gains hold across five local backbones and add only seconds of on-device latency on commodity hardware.

翻译：[translated abstract in Chinese] 混合本地-云端智能体在将能力密集型子任务委托给云端语言模型（CLM）之前，会利用持久工作状态中的上下文信息对用户请求进行丰富。虽然这种丰富化能提升任务成功率，但也会在发往云端的负载中暴露不必要的信息，包括与任务无关的上下文、先前工作流遗留内容以及过度具体的敏感细节，从而导致信息过曝。现有解决方案要么隔离工作流以限制跨工作流泄漏，要么应用不针对本地组装的负载范围进行推理的通用净化处理。本文提出PrivScope，一种受信任的设备端负载管控器，它在本地-CLM边界上强制执行基于任务范围的信息披露，无需云端侧做出任何更改。其核心思想是：敏感信息仅在委托子任务必需时才能到达云端，并且必须采用既能保留效用又尽可能少暴露的形式。PrivScope从组装好的负载中提取信息披露单元，将直接标识符和账户关联值保留在设备端。剩余单元通过云端必要性控制，确定实际所需信息；必须发送至云端的单元会被抽象为足以完成任务的最高层级表示。在三种商用CLM上对100个医疗预约工作流进行的测试表明，PrivScope将配置文件泄漏率降至0.0%（对比基线17.7%），将攻击者重识别率降低过半（23.1%对比64.3%），并且在所有测试CLM上均取得最高候选召回率，同时在GPT-4o-mini和Gemini 2.5 Flash上保持了与无保护基线相近的任务成功率。上述增益在五种本地骨干模型上均保持稳定，且仅在商用硬件上增加数秒设备端延迟。