The emergence of Large language models (LLMs) is expected to have a major impact on education. This paper explores the potential of using ChatGPT, an LLM, as a virtual Teaching Assistant (TA) in an Introductory Programming Course. We evaluate ChatGPT's capabilities by comparing its performance with that of human TAs in some of the important TA functions. The TA functions which we focus on include (1) grading student code submissions, and (2) providing feedback to undergraduate students in an introductory programming course. Firstly, we assess ChatGPT's proficiency in grading student code submissions using a given grading rubric and compare its performance with the grades assigned by human TAs. Secondly, we analyze the quality and relevance of the feedback provided by ChatGPT. This evaluation considers how well ChatGPT addresses mistakes and offers suggestions for improvement in student solutions from both code correctness and code quality perspectives. We conclude with a discussion on the implications of integrating ChatGPT into computing education for automated grading, personalized learning experiences, and instructional support.
翻译:大型语言模型(LLMs)的出现预计将对教育产生重大影响。本文探讨了使用ChatGPT(一种大语言模型)作为入门编程课程虚拟助教(TA)的潜力。我们通过比较ChatGPT与人类助教在若干重要助教职能上的表现,评估了其能力。重点关注的助教职能包括:(1)为学生代码提交评分;(2)向入门编程课程的本科生提供反馈。首先,我们利用给定评分标准评估ChatGPT对学生代码提交的评分能力,并将其与人类助教的评分结果进行对比。其次,我们分析了ChatGPT提供反馈的质量与相关性,从代码正确性和代码质量两个维度考量其在识别学生代码错误及提供改进建议方面的表现。最后,我们探讨了将ChatGPT整合到计算机教育中,对自动化评分、个性化学习体验与教学辅助的启示。