Harnessing the Power of LLMs: Automating Unit Test Generation for High-Performance Computing

Unit testing is crucial in software engineering for ensuring quality. However, it's not widely used in parallel and high-performance computing software, particularly scientific applications, due to their smaller, diverse user base and complex logic. These factors make unit testing challenging and expensive, as it requires specialized knowledge and existing automated tools are often ineffective. To address this, we propose an automated method for generating unit tests for such software, considering their unique features like complex logic and parallel processing. Recently, large language models (LLMs) have shown promise in coding and testing. We explored the capabilities of Davinci (text-davinci-002) and ChatGPT (gpt-3.5-turbo) in creating unit tests for C++ parallel programs. Our results show that LLMs can generate mostly correct and comprehensive unit tests, although they have some limitations, such as repetitive assertions and blank test cases.

翻译：单元测试在软件工程中对确保质量至关重要。然而，由于并行与高性能计算软件（特别是科学应用）的用户群体规模较小且多样化，以及其逻辑复杂，单元测试在此类软件中并未得到广泛应用。这些因素使得单元测试具有挑战性且成本高昂，因为它需要专业知识，而现有的自动化工具往往效果不佳。为解决这一问题，我们提出了一种为此类软件生成单元测试的自动化方法，该方法考虑了其复杂逻辑和并行处理等独特特性。最近，大型语言模型（LLMs）在编码和测试方面展现出潜力。我们探索了Davinci（text-davinci-002）和ChatGPT（gpt-3.5-turbo）为C++并行程序创建单元测试的能力。我们的结果表明，LLMs能够生成大部分正确且全面的单元测试，尽管它们存在一些局限性，例如重复的断言和空测试用例。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日