Lately, Large Language Models have been widely used in code generation. GPT4 is considered the most potent Large Language Model from Openai. In this paper, we examine GPT3.5 and GPT4 as coding assistants. More specifically, we have constructed appropriate tests to check whether the two systems can a) answer typical questions that can arise during the code development, b) produce reliable code, and c) contribute to code debugging. The test results are impressive. The performance of GPT4 is outstanding and signals an increase in the productivity of programmers and the reorganization of software development procedures based on these new tools.
翻译:近年来,大语言模型已广泛应用于代码生成领域。GPT4被视为OpenAI开发的最强大的大语言模型。本文对GPT3.5和GPT4作为编程助手的能力进行了研究。具体而言,我们设计了专门测试来检验这两个系统是否能够:a) 回答代码开发过程中可能出现的典型问题,b) 生成可靠的代码,以及c) 辅助代码调试。测试结果令人瞩目。GPT4表现卓越,标志着程序员生产力的提升以及基于这些新工具的软件开发流程重组。