A Survey on Large Language Models for Software Engineering

Software Engineering (SE) is the systematic design, development, and maintenance of software applications, underpinning the digital infrastructure of our modern mainworld. Very recently, the SE community has seen a rapidly increasing number of techniques employing Large Language Models (LLMs) to automate a broad range of SE tasks. Nevertheless, existing information of the applications, effects, and possible limitations of LLMs within SE is still not well-studied. In this paper, we provide a systematic survey to summarize the current state-of-the-art research in the LLM-based SE community. We summarize 30 representative LLMs of Source Code across three model architectures, 15 pre-training objectives across four categories, and 16 downstream tasks across five categories. We then present a detailed summarization of the recent SE studies for which LLMs are commonly utilized, including 155 studies for 43 specific code-related tasks across four crucial phases within the SE workflow. Besides, we summarize existing attempts to empirically evaluate LLMs in SE, such as benchmarks, empirical studies, and exploration of SE education. We also discuss several critical aspects of optimization and applications of LLMs in SE, such as security attacks, model tuning, and model compression. Finally, we highlight several challenges and potential opportunities on applying LLMs for future SE studies, such as exploring domain LLMs and constructing clean evaluation datasets. Overall, our work can help researchers gain a comprehensive understanding about the achievements of the existing LLM-based SE studies and promote the practical application of these techniques. Our artifacts are publicly available and will continuously updated at the living repository: \url{https://github.com/iSEngLab/AwesomeLLM4SE}.

翻译：软件工程（SE）是软件应用的系统化设计、开发与维护，支撑着我们现代世界的数字基础设施。近期，软件工程领域涌现出大量利用大语言模型（LLM）自动化处理各类软件工程任务的技术。然而，关于LLM在软件工程中的应用、效果及潜在局限性的现有信息尚未得到充分研究。本文旨在通过系统性综述，总结当前基于LLM的软件工程研究的最新进展。我们归纳了涵盖3种模型架构的30种代表性源代码LLM、4类共15种预训练目标，以及5类共16种下游任务。随后，我们详细梳理了近期软件工程研究中LLM的常见应用场景，涵盖软件工程流程四个关键阶段的43项具体代码相关任务（共计155项研究）。此外，我们总结了现有在软件工程中对LLM进行实证评估的尝试，包括基准测试、实证研究及软件工程教育探索。同时，我们探讨了LLM在软件工程中的优化与应用关键层面，如安全攻击、模型调优与模型压缩。最后，我们指出了未来软件工程研究中应用LLM的若干挑战与潜在机遇，例如探索领域专用LLM及构建清洁评估数据集。总体而言，本工作有助于研究者全面理解现有基于LLM的软件工程研究成果，并推动这些技术的实际应用。我们的成果已公开，并将在持续更新的存储库中维护：\url{https://github.com/iSEngLab/AwesomeLLM4SE}。

相关内容

大语言模型

关注 67

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日