Automated Test Case Repair Using Language Models

Ensuring the quality of software systems through testing is essential, yet maintaining test cases poses significant challenges and costs. The need for frequent updates to align with the evolving system under test often entails high complexity and cost for maintaining these test cases. Further, unrepaired broken test cases can degrade test suite quality and disrupt the software development process, wasting developers' time. To address this challenge, we present TaRGet (Test Repair GEneraTor), a novel approach leveraging pre-trained code language models for automated test case repair. TaRGet treats test repair as a language translation task, employing a two-step process to fine-tune a language model based on essential context data characterizing the test breakage. To evaluate our approach, we introduce TaRBench, a comprehensive benchmark we developed covering 45,373 broken test repairs across 59 open-source projects. Our results demonstrate TaRGet's effectiveness, achieving a 66.1% exact match accuracy. Furthermore, our study examines the effectiveness of TaRGet across different test repair scenarios. We provide a practical guide to predict situations where the generated test repairs might be less reliable. We also explore whether project-specific data is always necessary for fine-tuning and if our approach can be effective on new projects.

翻译：通过测试确保软件系统的质量至关重要，但维护测试用例却面临显著挑战和成本。测试用例需要频繁更新以适配被测系统的持续演进，这往往带来极高的复杂性和维护成本。此外，未修复的失效测试用例会降低测试套件质量，扰乱软件开发流程，并浪费开发人员时间。针对这一挑战，我们提出TaRGet（测试修复生成器）——一种利用预训练代码语言模型实现自动化测试用例修复的新方法。TaRGet将测试修复视为语言翻译任务，采用两步流程基于表征测试失效的关键上下文数据对语言模型进行微调。为评估该方法，我们构建了覆盖59个开源项目中45,373个失效测试修复的综合性基准测试集TaRBench。实验结果表明TaRGet具有显著效果，精确匹配准确率达到66.1%。我们进一步研究了TaRGet在不同测试修复场景下的有效性，提供实用指南用于预测生成修复可能不可靠的情形，同时探讨项目特定数据对微调的必要性以及该方法在新项目中的适用性。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日