Exploring the Potential of ChatGPT in Automated Code Refinement: An Empirical Study

Code review is an essential activity for ensuring the quality and maintainability of software projects. However, it is a time-consuming and often error-prone task that can significantly impact the development process. Recently, ChatGPT, a cutting-edge language model, has demonstrated impressive performance in various natural language processing tasks, suggesting its potential to automate code review processes. However, it is still unclear how well ChatGPT performs in code review tasks. To fill this gap, in this paper, we conduct the first empirical study to understand the capabilities of ChatGPT in code review tasks, specifically focusing on automated code refinement based on given code reviews. To conduct the study, we select the existing benchmark CodeReview and construct a new code review dataset with high quality. We use CodeReviewer, a state-of-the-art code review tool, as a baseline for comparison with ChatGPT. Our results show that ChatGPT outperforms CodeReviewer in code refinement tasks. Specifically, our results show that ChatGPT achieves higher EM and BLEU scores of 22.78 and 76.44 respectively, while the state-of-the-art method achieves only 15.50 and 62.88 on a high-quality code review dataset. We further identify the root causes for ChatGPT's underperformance and propose several strategies to mitigate these challenges. Our study provides insights into the potential of ChatGPT in automating the code review process, and highlights the potential research directions.

翻译：代码审查是确保软件项目质量和可维护性的关键活动。然而，这是一项耗时且易出错的任务，可能显著影响开发流程。近期，作为尖端语言模型的ChatGPT在多种自然语言处理任务中展现出卓越性能，表明其具备自动化代码审查流程的潜力。但ChatGPT在代码审查任务中的具体表现仍不明确。为填补这一空白，本文首次通过实证研究理解ChatGPT在代码审查任务中的能力，尤其聚焦于基于给定审查意见的自动化代码优化。我们选取现有基准数据集CodeReview，并构建了高质量的新代码审查数据集。以当前最先进的代码审查工具CodeReviewer作为基线，与ChatGPT进行对比。结果显示，ChatGPT在代码优化任务中全面优于CodeReviewer。具体而言，在高质量数据集上，ChatGPT的EM和BLEU评分分别达到22.78和76.44，而最先进方法仅为15.50和62.88。我们进一步识别了导致ChatGPT性能不足的根源，并提出多项缓解策略。本研究揭示了ChatGPT在自动化代码审查流程中的潜力，并指明了潜在研究方向。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日