Effective instruction in tutoring requires promptly providing instructional materials that match the needs of each student (e.g., in response to questions). In this study, we introduce an agent that automatically delivers supplementary materials on demand during one-on-one tutoring sessions. Our agent uses a multimodal large language model to analyze spoken dialogue between the instructor and the student, automatically generate search queries, and retrieve relevant Web images. Evaluation experiments demonstrate that our agent reduces the average image retrieval time by 44.4 s compared to cases without support and successfully provides images of acceptable quality in 85.7% of trials. These results indicate that our agent effectively supports instructors during tutoring sessions.
翻译:有效的一对一辅导教学要求能够根据每位学生的需求(例如针对问题)及时提供相应的教学材料。本研究介绍了一种能够在辅导过程中按需自动提供补充材料的智能体。该智能体利用多模态大语言模型分析教师与学生之间的口语对话,自动生成搜索查询,并检索相关的网络图像。评估实验表明,与未获支持的情况相比,该智能体将平均图像检索时间减少了44.4秒,并在85.7%的试验中成功提供了质量可接受的图像。这些结果表明,我们的智能体能够在辅导过程中有效支持教师。