Italian Crossword Generator: Enhancing Education through Interactive Word Puzzles

Educational crosswords offer numerous benefits for students, including increased engagement, improved understanding, critical thinking, and memory retention. Creating high-quality educational crosswords can be challenging, but recent advances in natural language processing and machine learning have made it possible to use language models to generate nice wordplays. The exploitation of cutting-edge language models like GPT3-DaVinci, GPT3-Curie, GPT3-Babbage, GPT3-Ada, and BERT-uncased has led to the development of a comprehensive system for generating and verifying crossword clues. A large dataset of clue-answer pairs was compiled to fine-tune the models in a supervised manner to generate original and challenging clues from a given keyword. On the other hand, for generating crossword clues from a given text, Zero/Few-shot learning techniques were used to extract clues from the input text, adding variety and creativity to the puzzles. We employed the fine-tuned model to generate data and labeled the acceptability of clue-answer parts with human supervision. To ensure quality, we developed a classifier by fine-tuning existing language models on the labeled dataset. Conversely, to assess the quality of clues generated from the given text using zero/few-shot learning, we employed a zero-shot learning approach to check the quality of generated clues. The results of the evaluation have been very promising, demonstrating the effectiveness of the approach in creating high-standard educational crosswords that offer students engaging and rewarding learning experiences.

翻译：教育类填字游戏为学生带来诸多益处，包括提高参与度、加深理解、培养批判性思维以及增强记忆力。创建高质量的教育填字游戏颇具挑战性，但自然语言处理与机器学习的最新进展使得利用语言模型生成优质文字游戏成为可能。通过采用GPT3-DaVinci、GPT3-Curie、GPT3-Babbage、GPT3-Ada及BERT-uncased等前沿语言模型，我们开发了一套用于生成和验证填字游戏线索的完整系统。为通过监督学习方式微调模型以从给定关键词生成原创且具有挑战性的线索，我们构建了包含大量线索-答案对的数据集。另一方面，针对从给定文本生成填字游戏线索的需求，我们采用零样本/少样本学习技术从输入文本中提取线索，从而增加谜题的多样性和创造性。我们利用微调模型生成数据，并通过人工监督对线索-答案部分的可接受性进行标注。为确保质量，我们在标注数据集上对现有语言模型进行微调，开发出分类器。反之，为评估通过零样本/少样本学习从给定文本生成的线索质量，我们采用零样本学习方法检验生成线索的质量。评估结果非常理想，证明了该方法在创建高水准教育填字游戏方面的有效性，能够为学生提供引人入胜且富有价值的学习体验。