The recent success of Large Language Models (LLMs) signifies an impressive stride towards artificial general intelligence. They have shown a promising prospect in automatically completing tasks upon user instructions, functioning as brain-like coordinators. The associated risks will be revealed as we delegate an increasing number of tasks to machines for automated completion. A big question emerges: how can we make machines behave responsibly when helping humans automate tasks as personal copilots? In this paper, we explore this question in depth from the perspectives of feasibility, completeness and security. In specific, we present Responsible Task Automation (ResponsibleTA) as a fundamental framework to facilitate responsible collaboration between LLM-based coordinators and executors for task automation with three empowered capabilities: 1) predicting the feasibility of the commands for executors; 2) verifying the completeness of executors; 3) enhancing the security (e.g., the protection of users' privacy). We further propose and compare two paradigms for implementing the first two capabilities. One is to leverage the generic knowledge of LLMs themselves via prompt engineering while the other is to adopt domain-specific learnable models. Moreover, we introduce a local memory mechanism for achieving the third capability. We evaluate our proposed ResponsibleTA on UI task automation and hope it could bring more attentions to ensuring LLMs more responsible in diverse scenarios. The research project homepage is at https://task-automation-research.github.io/responsible_task_automation.
翻译:近期大型语言模型的成功标志着向通用人工智能迈出了令人瞩目的一步。这些模型在根据用户指令自动完成任务方面展现出广阔前景,能够充当类脑协调者。随着我们将越来越多的任务委托给机器自动完成,相关风险也将逐渐显现。一个重大疑问随之产生:当机器作为个人副手协助人类自动化任务时,我们如何确保其行为负责?本文从可行性、完整性和安全性三个维度深入探讨这一问题。具体而言,我们提出"负责任的任务自动化"(ResponsibleTA)作为基础框架,通过三种赋能能力促进基于LLM的协调者与执行者之间的负责任协作:1)预测指令对执行者的可行性;2)验证执行者的任务完整性;3)增强安全性(例如保护用户隐私)。我们进一步提出并比较了实现前两种能力的两种范式:一种是通过提示工程利用LLM自身的通用知识,另一种是采用领域特定的可学习模型。此外,我们引入本地内存机制来实现第三种能力。我们在UI任务自动化场景下评估了所提出的ResponsibleTA框架,并期望该工作能引发更多关注,确保LLM在多种场景下更加负责任。研究项目主页位于https://task-automation-research.github.io/responsible_task_automation。