Ceci n'est pas une explication: Evaluating Explanation Failures as Explainability Pitfalls in Language Learning Systems

AI-powered language learning tools increasingly provide instant, personalised feedback to millions of learners worldwide. However, this feedback can fail in ways that are difficult for learners--and even teachers--to detect, potentially reinforcing misconceptions and eroding learning outcomes over extended use. We present a portion of L2-Bench, a benchmark for evaluating AI systems in language education that includes (but is not limited to) six critical dimensions of effective feedback: diagnostic accuracy, awareness of appropriacy, causes of error, prioritisation, guidance for improvement, and supporting self-regulation. We analyse how AI systems can fail with respect to these dimensions. These failures, which we argue are conducive to "explainability pitfalls," are AI-generated explanations that appear helpful on the surface but are fundamentally flawed, increasing the risk of attainment, human-AI interaction, and socioaffective harms. We discuss how the specific context of language learning amplifies these risks and outline open questions we believe merit more attention when designing evaluation frameworks specifically. Our analysis aims to expand the community's understanding of both the typology of explainability pitfalls and the contextual dynamics in which they may occur in order to encourage AI developers to better design safe, trustworthy, and effective AI explanations.

翻译：AI驱动的语言学习工具正日益为数百万全球学习者提供即时的个性化反馈。然而，这类反馈可能以学习者甚至教师难以察觉的方式失效，长期使用中可能强化误解、侵蚀学习成效。我们提出L2-Bench基准的一部分，该基准用于评估语言教育中的AI系统，涵盖（但不限于）有效反馈的六个关键维度：诊断准确性、恰当性意识、错误根源、优先级排序、改进指导以及支持自我调节。我们分析了AI系统在这些维度上的失败方式。这些失败被我们论证为易于引发“可解释性陷阱”——即表面看似有益但实质上存在根本缺陷的AI生成解释，从而增加成就风险、人机交互风险及社会情感伤害风险。我们讨论了语言学习的具体情境如何放大这些风险，并概述了在设计评估框架时我们认为值得更多关注的开放性问题。本分析旨在拓展学界对可解释性陷阱类型学及其可能发生的语境动态的理解，以鼓励AI开发者设计更安全、可信且有效的AI解释。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

可解释人工智能（XAI）：从内在可解释性到大语言模型

专知会员服务

34+阅读 · 2025年1月20日

金融时间序列预测中的可解释人工智能（XAI）综述

专知会员服务

44+阅读 · 2024年7月25日

《可解释人工智能的态势感知框架 (SAFE-AI) 和 XAI 系统的人为因素考虑》麻省理工学院17页论文

专知会员服务

106+阅读 · 2023年2月19日

语音识别:不同深度学习方法的综述，Speech Recognition: a review of the different deep learning approaches

专知会员服务

33+阅读 · 2022年3月13日