Recent advances have enabled general computer-use agents that interpret screens and execute grounded actions from human instructions, yet they still struggle to generalize to unseen and evolving interfaces. While improving agent capability remains important, agent compatible interface design offers a complementary path by aligning interaction semantics with agent prior knowledge. In this paper, we revisit Nielsen 10 usability heuristics through the lens of computer-use agents, identifying which principles naturally transfer, where implicit design assumptions create agent specific failures, and how safe additive augmentations can improve robustness without harming human usability. To evaluate these ideas, we introduce UI-Verse, a suite of controlled environments built around functionally similar interfaces with different applied heuristics. Experiments show that our augmented heuristics consistently improve task completion and modestly improve efficiency, with combined heuristics yielding further gains. Human studies further show that these designs preserve the original interaction workflow without observable usability regressions. Overall, our findings highlight interface design as a practical complementary avenue for improving the reliability and generalization of computer use agents.
翻译:近期研究进展使得通用计算机使用代理得以实现,这类代理能够解析屏幕内容并根据人类指令执行基于环境的操作,但它们在泛化到未见过的动态界面时仍存在困难。尽管提升代理能力至关重要,但通过使交互语义与代理先验知识对齐,代理兼容性界面设计提供了一条互补路径。本文从计算机使用代理的视角重新审视了尼尔森十条可用性启发式原则,识别出哪些原则可自然迁移、隐性设计假设如何导致代理特定失效,以及安全增强性修改如何在保持人类可用性的同时提升稳健性。为验证这些假设,我们构建了UI-Verse环境套件,该套件围绕功能相似但应用不同启发式原则的界面设计而成。实验表明,增强后的启发式原则持续提升任务完成率并适度改善效率,组合使用多种原则可进一步获得增益。人类用户研究进一步证实,这些设计保留了原始交互流程,且未出现可观测的可用性退化。总体而言,我们的研究揭示了界面设计作为提升计算机使用代理可靠性与泛化能力的实用互补途径。