Distilling ChatGPT for Explainable Automated Student Answer Assessment

Providing explainable and faithful feedback is crucial for automated student answer assessment. In this paper, we introduce a novel framework that explores using ChatGPT, a cutting-edge large language model, for the concurrent tasks of student answer scoring and rationale generation. We identify the appropriate instructions by prompting ChatGPT with different templates to collect the rationales, where inconsistent rationales are refined to align with marking standards. The refined ChatGPT outputs enable us to fine-tune a smaller language model that simultaneously assesses student answers and provides rationales. Extensive experiments on the benchmark dataset show that the proposed method improves the overall QWK score by 11% compared to ChatGPT. Furthermore, our thorough analysis and human evaluation demonstrate that the rationales generated by our proposed method are comparable to those of ChatGPT. Our approach provides a viable solution to achieve explainable automated assessment in education. Code available at https://github.com/lijiazheng99/aera.

翻译：提供可解释且忠实的反馈对于自动学生答案评估至关重要。本文提出了一种新颖框架，探索利用ChatGPT这一前沿大语言模型，同时完成学生答案评分与理由生成任务。我们通过不同提示模板引导ChatGPT生成理由，并识别出适当的指令集；针对不一致的理由进行精炼以符合评分标准。精炼后的ChatGPT输出可用于微调更小型的语言模型，使其能够同时评估学生答案并提供理由。在基准数据集上的广泛实验表明，相比ChatGPT，所提方法将整体QWK分数提升了11%。此外，我们的深入分析与人工评估证明，该方法生成的解释质量与ChatGPT相当。本研究为实现教育领域可解释的自动评估提供了可行方案。代码详见https://github.com/lijiazheng99/aera。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日