In order to train children's ability to ask curiosity-driven questions, previous research has explored designing specific exercises relying on providing semantic and linguistic cues to help formulate such questions. But despite showing pedagogical efficiency, this method is still limited as it relies on generating the said cues by hand, which can be a very costly process. In this context, we propose to leverage advances in the natural language processing field (NLP) and investigate the efficiency of using a large language model (LLM) for automating the production of the pedagogical content of a curious question-asking (QA) training. We study generating the said content using the "prompt-based" method that consists of explaining the task to the LLM in natural text. We evaluate the output using human experts annotations and comparisons with hand-generated content. Results suggested indeed the relevance and usefulness of this content. We also conduct a field study in primary school (75 children aged 9-10), where we evaluate children's QA performance when having this training. We compare 3 types of content : 1) hand-generated content that proposes "closed" cues leading to predefined questions; 2) GPT-3-generated content that proposes the same type of cues; 3) GPT-3-generated content that proposes "open" cues leading to several possible questions. We see a similar QA performance between the two "closed" trainings (showing the scalability of the approach using GPT-3), and a better one for participants with the "open" training. These results suggest the efficiency of using LLMs to support children in generating more curious questions, using a natural language prompting approach that affords usability by teachers and other users not specialists of AI techniques. Furthermore, results also show that open-ended content may be more suitable for training curious question-asking skills.
翻译:为培养儿童提出好奇心驱动问题的能力,先前研究探索了通过提供语义和语言线索来设计特定练习,以帮助构建这类问题。尽管该方法展现出教学有效性,但仍存在局限性:人工生成上述线索成本高昂。在此背景下,我们提出利用自然语言处理领域的最新进展,探究使用大型语言模型自动化生成好奇心提问训练教学内容的效率。我们采用"提示式"方法生成教学内容,即通过自然语言向LLM解释任务。通过人工专家标注及与人工生成内容的对比评估输出结果,结果验证了该内容的相关性与实用性。我们还在小学进行实地研究(75名9-10岁儿童),评估儿童在接受训练后的提问能力表现。比较三种内容类型:1)人工生成的"封闭式"线索(引导至预设问题);2)GPT-3生成的同类"封闭式"线索;3)GPT-3生成的"开放式"线索(导向多种可能问题)。两种"封闭式"训练下儿童的提问能力表现相似(表明使用GPT-3方法具有可扩展性),而接受"开放式"训练的参与者在能力表现上更优。这些结果表明,采用自然语言提示方法(便于教师及非AI技术人员使用)借助LLM能有效支持儿童提出更具好奇心的问题。此外,研究还显示开放式内容可能更适用于训练好奇心提问技能。