Using Large Language Models for Student-Code Guided Test Case Generation in Computer Science Education

In computer science education, test cases are an integral part of programming assignments since they can be used as assessment items to test students' programming knowledge and provide personalized feedback on student-written code. The goal of our work is to propose a fully automated approach for test case generation that can accurately measure student knowledge, which is important for two reasons. First, manually constructing test cases requires expert knowledge and is a labor-intensive process. Second, developing test cases for students, especially those who are novice programmers, is significantly different from those oriented toward professional-level software developers. Therefore, we need an automated process for test case generation to assess student knowledge and provide feedback. In this work, we propose a large language model-based approach to automatically generate test cases and show that they are good measures of student knowledge, using a publicly available dataset that contains student-written Java code. We also discuss future research directions centered on using test cases to help students.

翻译：在计算机科学教育中，测试用例是编程作业的重要组成部分，因为它们既可作为评估工具来检验学生的编程知识，又能为学生编写的代码提供个性化反馈。本研究旨在提出一种完全自动化的测试用例生成方法，能够准确衡量学生的知识掌握程度，这一目标具有双重重要意义。首先，手工构建测试用例既需要专业知识又耗费大量人力。其次，针对学生（尤其是编程新手）的测试用例开发与面向专业级软件开发人员的测试用例存在显著差异。因此，我们需要自动化的测试用例生成流程来评估学生知识并提供反馈。本研究提出基于大语言模型的方法自动生成测试用例，并通过包含学生编写的Java代码的公开数据集，证明这些测试用例能有效衡量学生的知识水平。最后，我们探讨了以测试用例辅助学生为核心的未来研究方向。

相关内容

CASES

关注 4

CASES：International Conference on Compilers, Architectures, and Synthesis for Embedded Systems。 Explanation：嵌入式系统编译器、体系结构和综合国际会议。 Publisher：ACM。 SIT： http://dblp.uni-trier.de/db/conf/cases/index.html

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日