Language agents show potential in being capable of utilizing natural language for varied and intricate tasks in diverse environments, particularly when built upon large language models (LLMs). Current language agent frameworks aim to facilitate the construction of proof-of-concept language agents while neglecting the non-expert user access to agents and paying little attention to application-level designs. We present OpenAgents, an open platform for using and hosting language agents in the wild of everyday life. OpenAgents includes three agents: (1) Data Agent for data analysis with Python/SQL and data tools; (2) Plugins Agent with 200+ daily API tools; (3) Web Agent for autonomous web browsing. OpenAgents enables general users to interact with agent functionalities through a web user interface optimized for swift responses and common failures while offering developers and researchers a seamless deployment experience on local setups, providing a foundation for crafting innovative language agents and facilitating real-world evaluations. We elucidate the challenges and opportunities, aspiring to set a foundation for future research and development of real-world language agents.
翻译:语言代理展现出在多样化环境中利用自然语言完成复杂多变任务的潜力,尤其是在基于大语言模型(LLMs)构建时。当前语言代理框架主要致力于促进概念验证型语言代理的构建,却忽视了非专业用户对代理的可及性,且对应用层面的设计关注不足。我们提出OpenAgents——一个面向日常生活场景中语言代理使用与部署的开放平台。OpenAgents包含三种代理:(1) 数据代理:支持基于Python/SQL的数据分析及数据工具;(2) 插件代理:集成200余种日常API工具;(3) 网页代理:实现自主网络浏览。该平台通过针对快速响应与常见故障进行优化的Web用户界面,使普通用户能够与代理功能交互;同时为开发者和研究人员提供本地环境的无缝部署体验,为创新语言代理的构建及真实场景评估奠定基础。我们阐明了相关挑战与机遇,旨在为未来真实环境语言代理的研究与发展奠定基础。