Large Language Models (LLMs) are advanced Artificial Intelligence (AI) systems that have undergone extensive training using large datasets in order to understand and produce language that closely resembles that of humans. These models have reached a level of proficiency where they are capable of successfully completing university exams across several disciplines and generating functional code to handle novel problems. This research investigates the coding proficiency of ChatGPT 3.5, a LLM released by OpenAI in November 2022, which has gained significant recognition for its impressive text generating and code creation capabilities. The skill of the model in creating code snippets is evaluated across 10 various programming languages and 4 different software domains. Based on the findings derived from this research, major unexpected behaviors and limitations of the model have been identified. This study aims to identify potential areas for development and examine the ramifications of automated code generation on the evolution of programming languages and on the tech industry.
翻译:大型语言模型(LLMs)是先进的人工智能(AI)系统,它们通过大量数据集的广泛训练来理解和生成接近人类的语言。这些模型已达到熟练掌握的程度,能够成功完成多个学科领域的大学考试,并生成功能代码来处理新问题。本研究调查了OpenAI于2022年11月发布的ChatGPT 3.5的编码能力——该模型因其出色的文本生成和代码创建能力而获得广泛认可。我们评估了该模型在10种不同编程语言和4个不同软件领域中生成代码片段的能力。基于本研究得出的结果,已识别出该模型的主要意外行为和局限性。本研究旨在明确潜在的改进领域,并探讨自动代码生成对编程语言演进及科技行业的影响。