Exploring Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context Learning

Multiple-choice questions (MCQs) are ubiquitous in almost all levels of education since they are easy to administer, grade, and are a reliable format in both assessments and practices. An important aspect of MCQs is the distractors, i.e., incorrect options that are designed to target specific misconceptions or insufficient knowledge among students. To date, the task of crafting high-quality distractors has largely remained a labor-intensive process for teachers and learning content designers, which has limited scalability. In this work, we explore the task of automated distractor and corresponding feedback message generation in math MCQs using large language models. We establish a formulation of these two tasks and propose a simple, in-context learning-based solution. Moreover, we explore using two non-standard metrics to evaluate the quality of the generated distractors and feedback messages. We conduct extensive experiments on these tasks using a real-world MCQ dataset that contains student response information. Our findings suggest that there is a lot of room for improvement in automated distractor and feedback generation. We also outline several directions for future work

翻译：多项选择题（MCQs）在各级教育中普遍存在，因其易于实施、评分便捷，且是评估与练习中可靠的题型。选择题的关键要素在于干扰项——即旨在针对学生特定误解或知识不足而设计的错误选项。迄今为止，设计高质量干扰项对教师和教学内容设计者而言仍是一项劳动密集型工作，这限制了其可扩展性。本研究探索利用大型语言模型自动生成数学选择题的干扰项及相应反馈信息。我们建立了这两项任务的公式化定义，并提出了一种基于上下文学习的简单解决方案。此外，我们尝试使用两种非标准指标来评估所生成干扰项与反馈信息的质量。我们利用包含学生作答信息的真实选择题数据集进行了广泛实验。研究结果表明，自动干扰项与反馈生成仍有较大改进空间，并指出了未来工作的若干方向。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日