GUI agents are rapidly becoming a new interaction to software, allowing people to navigate web, desktop and mobile rather than execute them click by click. Yet ``agent'' is described with radically different degrees of autonomy, obscuring capability, responsibility and risk. We call for conceptual clarity through GUI Agent Autonomy Levels (GAL), a six-level framework that makes autonomy explicit and helps benchmark progress toward trustworthy software interaction.
翻译:GUI智能体正迅速成为软件交互的新范式,使用户能够以超越逐次点击的方式操作网页、桌面及移动应用。然而当前对"智能体"自主性的描述存在巨大差异,导致其能力边界、责任归属与潜在风险模糊不清。本文提出通过GUI智能体自主性分级框架(GAL)建立概念明晰度,该六级框架明确界定自主性层级,为构建可信赖的软件交互系统提供基准评估体系。