Towards an Understanding of Large Language Models in Software Engineering Tasks

Large Language Models (LLMs) have drawn widespread attention and research due to their astounding performance in text generation and reasoning tasks. Derivative products, like ChatGPT, have been extensively deployed and highly sought after. Meanwhile, the evaluation and optimization of LLMs in software engineering tasks, such as code generation, have become a research focus. However, there is still a lack of systematic research on applying and evaluating LLMs in software engineering. Therefore, this paper comprehensively investigate and collate the research and products combining LLMs with software engineering, aiming to answer two questions: (1) What are the current integrations of LLMs with software engineering? (2) Can LLMs effectively handle software engineering tasks? To find the answers, we have collected related literature as extensively as possible from seven mainstream databases and selected 123 timely papers published starting from 2022 for analysis. We have categorized these papers in detail and reviewed the current research status of LLMs from the perspective of seven major software engineering tasks, hoping this will help researchers better grasp the research trends and address the issues when applying LLMs. Meanwhile, we have also organized and presented papers with evaluation content to reveal the performance and effectiveness of LLMs in various software engineering tasks, guiding researchers and developers to optimize.

翻译：大语言模型（LLMs）因其在文本生成与推理任务中的惊人表现而受到广泛关注与研究。以ChatGPT为代表的衍生产品已被大规模部署并备受追捧。与此同时，LLMs在代码生成等软件工程任务中的评估与优化已成为研究热点。然而，目前关于LLMs在软件工程中应用与评估的系统性研究仍显不足。为此，本文全面调研与梳理了LLMs与软件工程相结合的研究与产品，旨在回答两个核心问题：（1）当前LLMs与软件工程有哪些结合方式？（2）LLMs能否有效处理软件工程任务？为寻求答案，我们从七个主流数据库中尽可能广泛地收集了相关文献，并筛选出123篇自2022年以来发表的时效性论文进行分析。我们对这些论文进行了细致分类，并从七大软件工程任务的视角综述了LLMs的研究现状，以帮助研究者更好地把握研究趋势并应对LLMs应用中的问题。同时，我们还整理并呈现了包含评估内容的论文，以揭示LLMs在不同软件工程任务中的表现与效能，从而指导研究者与开发者进行优化。

相关内容

Engineering

关注 7

《工程》是中国工程院（CAE）于2015年推出的国际开放存取期刊。其目的是提供一个高水平的平台，传播和分享工程研发的前沿进展、当前主要研究成果和关键成果；报告工程科学的进展，讨论工程发展的热点、兴趣领域、挑战和前景，在工程中考虑人与环境的福祉和伦理道德，鼓励具有深远经济和社会意义的工程突破和创新，使之达到国际先进水平，成为新的生产力，从而改变世界，造福人类，创造新的未来。期刊链接：https://www.sciencedirect.com/journal/engineering

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日