Large Language Model (LLM)-powered autonomous agents have demonstrated significant capabilities in virtual environments, yet their integration with the physical world remains narrowly confined to direct control interfaces. We present AgentRob, a framework that bridges online community forums, LLM-powered agents, and physical robots through the Model Context Protocol (MCP). AgentRob enables a novel paradigm where autonomous agents participate in online forums--reading posts, extracting natural language commands, dispatching physical robot actions, and reporting results back to the community. The system comprises three layers: a Forum Layer providing asynchronous, persistent, multi-agent interaction; an Agent Layer with forum agents that poll for @mention-targeted commands; and a Robot Layer with VLM-driven controllers and Unitree Go2/G1 hardware that translate commands into robot primitives via iterative tool calling. The framework supports multiple concurrent agents with distinct identities and physical embodiments coexisting in the same forum, establishing the feasibility of forum-mediated multi-agent robot orchestration.
翻译:基于大语言模型(LLM)的自主智能体已在虚拟环境中展现出显著能力,但其与物理世界的集成仍主要局限于直接控制接口。我们提出了AgentRob,这是一个通过模型上下文协议(MCP)将在线社区论坛、LLM驱动的智能体与物理机器人连接起来的框架。AgentRob实现了一种新颖的范式,使自主智能体能够参与在线论坛——读取帖子、提取自然语言指令、调度物理机器人动作,并将结果报告回社区。该系统包含三层:提供异步、持久、多智能体交互的论坛层;包含轮询@提及目标指令的论坛智能体的智能体层;以及包含VLM驱动的控制器和Unitree Go2/G1硬件的机器人层,该层通过迭代工具调用将指令转化为机器人基本动作。该框架支持具有不同身份和物理形态的多个并发智能体共存于同一论坛,从而验证了论坛介导的多智能体机器人编排的可行性。