Towards an Understanding of Large Language Models in Software Engineering Tasks

Large Language Models (LLMs) have drawn widespread attention and research due to their astounding performance in tasks such as text generation and reasoning. Derivative products, like ChatGPT, have been extensively deployed and highly sought after. Meanwhile, the evaluation and optimization of LLMs in software engineering tasks, such as code generation, have become a research focus. However, there is still a lack of systematic research on the application and evaluation of LLMs in the field of software engineering. Therefore, this paper is the first to comprehensively investigate and collate the research and products combining LLMs with software engineering, aiming to answer two questions: (1) What are the current integrations of LLMs with software engineering? (2) Can LLMs effectively handle software engineering tasks? To find the answers, we have collected related literature as extensively as possible from seven mainstream databases, and selected 123 papers for analysis. We have categorized these papers in detail and reviewed the current research status of LLMs from the perspective of seven major software engineering tasks, hoping this will help researchers better grasp the research trends and address the issues when applying LLMs. Meanwhile, we have also organized and presented papers with evaluation content to reveal the performance and effectiveness of LLMs in various software engineering tasks, providing guidance for researchers and developers to optimize.

翻译：大型语言模型（LLMs）因其在文本生成和推理等任务中的惊人表现而受到广泛关注和研究。诸如ChatGPT等衍生品已被广泛部署并备受追捧。同时，LLMs在代码生成等软件工程任务中的评估与优化已成为研究热点。然而，目前对于LLMs在软件工程领域的应用与评估仍缺乏系统性研究。因此，本文首次全面调查和整理了LLMs与软件工程相结合的研究及产品，旨在回答两个问题：（1）当前LLMs与软件工程的融合现状如何？（2）LLMs能否有效处理软件工程任务？为寻找答案，我们尽可能全面地收集了来自七个主流数据库的相关文献，并筛选出123篇论文进行分析。我们对这些论文进行了详细分类，并从七大类软件工程任务的角度审视了LLMs的研究现状，期望帮助研究者更好地把握研究趋势并解决应用LLMs时遇到的问题。同时，我们还整理并呈现了包含评估内容的论文，以揭示LLMs在各种软件工程任务中的性能与有效性，为研究人员和开发者的优化工作提供指导。

相关内容

Engineering

关注 7

《工程》是中国工程院（CAE）于2015年推出的国际开放存取期刊。其目的是提供一个高水平的平台，传播和分享工程研发的前沿进展、当前主要研究成果和关键成果；报告工程科学的进展，讨论工程发展的热点、兴趣领域、挑战和前景，在工程中考虑人与环境的福祉和伦理道德，鼓励具有深远经济和社会意义的工程突破和创新，使之达到国际先进水平，成为新的生产力，从而改变世界，造福人类，创造新的未来。期刊链接：https://www.sciencedirect.com/journal/engineering

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日