L2MAC: Large Language Model Automatic Computer for Unbounded Code Generation

Transformer-based large language models (LLMs) are constrained by the fixed context window of the underlying transformer architecture, hindering their ability to produce long and logically consistent code. Memory-augmented LLMs are a promising solution, but current approaches cannot handle long code generation tasks since they (1) only focus on reading memory and reduce its evolution to the concatenation of new memories or (2) use very specialized memories that cannot adapt to other domains. This paper presents L2MAC, the first practical LLM-based stored-program automatic computer for long and consistent code generation. Its memory has two components: the instruction registry, which is populated with a prompt program to solve the user-given task, and a file store, which will contain the final and intermediate outputs. Each instruction is executed by a separate LLM instance, whose context is managed by a control unit capable of precise memory reading and writing to ensure effective interaction with the file store. These components enable L2MAC to generate virtually unbounded code structures, bypassing the constraints of the finite context window while producing code that fulfills complex user-specified requirements. We empirically show that L2MAC succeeds in generating large code bases for system design tasks where other coding methods fall short in implementing user requirements and provide insight into the reasons for this performance gap.

翻译：基于Transformer的大型语言模型（LLMs）受限于底层Transformer架构的固定上下文窗口，难以生成逻辑一致的长代码。记忆增强型语言模型是一种有前景的解决方案，但现有方法无法处理长代码生成任务，原因在于：（1）它们仅关注记忆读取，将记忆演化简化为新记忆的拼接；或（2）使用高度专门化的记忆，难以适应其他领域。本文提出L2MAC，这是首个基于LLM的实用存储程序自动计算机，专为生成逻辑一致的长代码而设计。其记忆包含两个组件：指令寄存器，用于存储解决用户给定任务的提示程序；文件存储，用于保存最终和中间输出。每条指令由独立的LLM实例执行，其上下文由控制单元管理，该控制单元能够精确读写记忆，确保与文件存储的有效交互。这些组件使L2MAC能够生成几乎无界的代码结构，突破有限上下文窗口的限制，同时生成满足复杂用户需求的代码。实验结果表明，L2MAC能成功生成系统设计任务的大型代码库，而其他编码方法在此类任务中难以实现用户需求，并深入分析了导致这一性能差异的原因。

相关内容

大语言模型

关注 67

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日