Improving Performance of Commercially Available AI Products in a Multi-Agent Configuration

In recent years, with the rapid advancement of large language models (LLMs), multi-agent systems have become increasingly more capable of practical application. At the same time, the software development industry has had a number of new AI-powered tools developed that improve the software development lifecycle (SDLC). Academically, much attention has been paid to the role of multi-agent systems to the SDLC. And, while single-agent systems have frequently been examined in real-world applications, we have seen comparatively few real-world examples of publicly available commercial tools working together in a multi-agent system with measurable improvements. In this experiment we test context sharing between Crowdbotics PRD AI, a tool for generating software requirements using AI, and GitHub Copilot, an AI pair-programming tool. By sharing business requirements from PRD AI, we improve the code suggestion capabilities of GitHub Copilot by 13.8% and developer task success rate by 24.5% -- demonstrating a real-world example of commercially-available AI systems working together with improved outcomes.

翻译：近年来，随着大语言模型（LLMs）的快速发展，多智能体系统在实际应用中的能力日益增强。与此同时，软件开发行业已涌现出众多新型AI驱动工具，旨在改善软件开发生命周期（SDLC）。学术界对多智能体系统在SDLC中的作用给予了大量关注。然而，尽管单智能体系统在现实应用中的研究已较为常见，但公开可用的商业工具在多智能体系统中协同工作并带来可量化性能提升的实际案例相对较少。本实验测试了Crowdbotics PRD AI（一款利用AI生成软件需求的工具）与GitHub Copilot（一款AI结对编程工具）之间的上下文共享。通过共享PRD AI生成的业务需求，我们将GitHub Copilot的代码建议能力提升了13.8%，并将开发者的任务成功率提高了24.5%——这为商用AI系统协同工作并实现改进成果提供了一个现实案例。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日