Lemur: Harmonizing Natural Language and Code for Language Agents

Yiheng Xu,Hongjin Su,Chen Xing,Boyu Mi,Qian Liu,Weijia Shi,Binyuan Hui,Fan Zhou,Yitao Liu,Tianbao Xie,Zhoujun Cheng,Siheng Zhao,Lingpeng Kong,Bailin Wang,Caiming Xiong,Tao Yu

from arxiv, ICLR 2024 Spotlight; https://github.com/OpenLemur/Lemur

We introduce Lemur and Lemur-Chat, openly accessible language models optimized for both natural language and coding capabilities to serve as the backbone of versatile language agents. The evolution from language chat models to functional language agents demands that models not only master human interaction, reasoning, and planning but also ensure grounding in the relevant environments. This calls for a harmonious blend of language and coding capabilities in the models. Lemur and Lemur-Chat are proposed to address this necessity, demonstrating balanced proficiencies in both domains, unlike existing open-source models that tend to specialize in either. Through meticulous pre-training using a code-intensive corpus and instruction fine-tuning on text and code data, our models achieve state-of-the-art averaged performance across diverse text and coding benchmarks among open-source models. Comprehensive experiments demonstrate Lemur's superiority over existing open-source models and its proficiency across various agent tasks involving human communication, tool usage, and interaction under fully- and partially- observable environments. The harmonization between natural and programming languages enables Lemur-Chat to significantly narrow the gap with proprietary models on agent abilities, providing key insights into developing advanced open-source agents adept at reasoning, planning, and operating seamlessly across environments. https://github.com/OpenLemur/Lemur

翻译：我们推出Lemur与Lemur-Chat，这是两款开源且专为兼具自然语言与代码能力而优化的语言模型，旨在作为多功能语言智能体的核心基础。从语言对话模型演进为功能性语言智能体，要求模型不仅需精通人际交互、推理与规划，还必须确保在相关环境中的基础适应性。这需要模型实现语言与编码能力的和谐融合。为应对这一需求，我们提出了Lemur与Lemur-Chat，其在两个领域均展现出均衡的熟练度，这与当前倾向于专精某一领域的现有开源模型形成鲜明对比。通过使用代码密集型语料库进行细致预训练，并结合文本与代码数据的指令微调，我们的模型在开源模型中，于多样化的文本与代码基准测试上取得了领先的平均性能。综合实验表明，Lemur优于现有开源模型，并在涉及人类交流、工具使用以及在全观测与部分可观测环境下交互的多种智能体任务中表现出色。自然语言与编程语言的协调使得Lemur-Chat在智能体能力上显著缩小了与专有模型的差距，为开发能够在不同环境中无缝进行推理、规划与操作的高级开源智能体提供了关键见解。https://github.com/OpenLemur/Lemur

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日