Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning

Intelligent Tutoring Systems (ITS) enhance personalized learning by predicting student answers to provide immediate and customized instruction. However, recent research has primarily focused on the correctness of the answer rather than the student's performance on specific answer choices, limiting insights into students' thought processes and potential misconceptions. To address this gap, we present MCQStudentBert, an answer forecasting model that leverages the capabilities of Large Language Models (LLMs) to integrate contextual understanding of students' answering history along with the text of the questions and answers. By predicting the specific answer choices students are likely to make, practitioners can easily extend the model to new answer choices or remove answer choices for the same multiple-choice question (MCQ) without retraining the model. In particular, we compare MLP, LSTM, BERT, and Mistral 7B architectures to generate embeddings from students' past interactions, which are then incorporated into a finetuned BERT's answer-forecasting mechanism. We apply our pipeline to a dataset of language learning MCQ, gathered from an ITS with over 10,000 students to explore the predictive accuracy of MCQStudentBert, which incorporates student interaction patterns, in comparison to correct answer prediction and traditional mastery-learning feature-based approaches. This work opens the door to more personalized content, modularization, and granular support.

翻译：智能辅导系统（ITS）通过预测学生答案来提供即时个性化指导，从而增强个性化学习。然而，现有研究主要关注答案的正确性，而非学生在具体选项上的表现，这限制了对学生思维过程及潜在误解的深入洞察。为填补这一空白，我们提出了MCQStudentBert——一种答案预测模型，该模型利用大语言模型（LLMs）的能力，整合对学生答题历史的上下文理解以及问题与选项的文本信息。通过预测学生可能选择的特定答案选项，实践者能够轻松地将模型扩展至新的选项，或为同一多项选择题（MCQ）移除某些选项，而无需重新训练模型。具体而言，我们比较了MLP、LSTM、BERT和Mistral 7B架构，以从学生历史交互中生成嵌入表示，并将其整合到经过微调的BERT答案预测机制中。我们将该流程应用于一个语言学习MCQ数据集（采集自拥有超过10,000名学生的ITS），以探究融合了学生交互模式的MCQStudentBert的预测准确性，并与正确答案预测及传统的基于掌握学习特征的方法进行比较。此项工作为更个性化的内容、模块化设计以及精细化支持开辟了新的途径。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/