tap: A File-Based Protocol for Heterogeneous LLM Agent Collaboration

Existing multi-agent software development systems have proposed many forms of agent collaboration, including role-based collaboration and automated code review. However, many systems assume a common runtime, a central conversation server, or the same API family. Under these assumptions, LLM agents from different vendors cannot easily exchange messages directly from their own execution environments while dividing development and review work on a shared codebase. This paper presents tap, a file-based collaboration protocol that allows Claude (Anthropic) and Codex (OpenAI) to collaborate on one codebase without shared memory or an identical runtime. The core of tap is a file-first design that preserves markdown files with metadata as original messages, combines a file inspection path (file communication, Tier 1) with real-time notification paths for Claude and Codex (real-time communication, Tier 2), and isolates work through separate git worktrees. Even if real-time notification fails or a receiver restarts, the message file remains available and the same content can be inspected again. In a 27-day, 37-generation self-applied operation where tap was used to develop and review itself, we collected 209 tap-related pull requests and 717 operational artifacts. An analysis of 375 review artifacts showed that the share of reviews recording at least one defect or requested change was 69.8% for heterogeneous model pairs and 53.1% for homogeneous model pairs. These results show that tap, which combines file-based message preservation with real-time notification, operates in a real production repository, and that combining heterogeneous models and execution environments can broaden review perspectives. tap is distributed as the open-source npm package @hua-labs/tap (v0.5.2).

翻译：摘要：现有多智能体软件开发系统提出了多种智能体协作形式，包括基于角色的协作及自动化代码审查。然而，许多系统假设存在公共运行时、中央对话服务器或同一API族。在此假设下，来自不同供应商的大语言模型智能体无法在其各自执行环境中直接交换消息，同时在同一共享代码库上划分开发与审查工作。本文提出tap——一种基于文件的协作协议，允许Claude（Anthropic）与Codex（OpenAI）在无需共享内存或相同运行时的条件下协作于同一代码库。tap的核心是"文件优先"设计：将含元数据的Markdown文件作为原始消息保存，结合文件检查路径（文件通信，第1层）与面向Claude和Codex的实时通知路径（实时通信，第2层），并通过独立git工作树隔离任务。即使实时通知失败或接收端重启，消息文件仍可访问，同一内容亦可重新检查。在27天、37次自我迭代的应用中（tap被用于开发并审查自身），我们收集了209个与tap相关的拉取请求及717份操作工件。对375份审查工件的分析显示，记录至少一个缺陷或请求变更的审查比例在异构模型对中为69.8%，在同构模型对中为53.1%。结果表明，结合基于文件的消息保留与实时通知的tap能够在实际生产仓库中运行，且异构模型与执行环境的组合可拓展审查视角。tap以开源npm包@hua-labs/tap（v0.5.2）形式分发。

相关内容

TAP

关注 819

ACM应用感知TAP(ACM Transactions on Applied Perception)旨在通过发表有助于统一这些领域研究的高质量论文来增强计算机科学与心理学/感知之间的协同作用。该期刊发表跨学科研究，在跨计算机科学和感知心理学的任何主题领域都具有重大而持久的价值。所有论文都必须包含感知和计算机科学两个部分。主题包括但不限于：视觉感知：计算机图形学，科学/数据/信息可视化，数字成像，计算机视觉，立体和3D显示技术。听觉感知：听觉显示和界面，听觉听觉编码，空间声音，语音合成和识别。触觉：触觉渲染，触觉输入和感知。感觉运动知觉：手势输入，身体运动输入。感官感知：感官整合，多模式渲染和交互。官网地址：http://dblp.uni-trier.de/db/journals/tap/

超越个体智能：基于LLM的多智能体系统中的协作、故障归因与自演化综述

专知会员服务

20+阅读 · 5月16日

【ICML2026】MASPO：面向基于大语言模型的多智能体系统的联合提示词优化

专知会员服务

12+阅读 · 5月9日

多智能体协作机制

专知会员服务

23+阅读 · 4月25日

大语言模型智能体中的外显化机制：记忆、技能、协议与评测基准工程综述

专知会员服务

34+阅读 · 4月19日