Fix the Mind, Not the Move: Interpretable AI Assistance via Knowledge-Gap Localization

AI assistants in human-AI collaboration often correct suboptimal human actions through behavioral feedback (e.g., alerts or steering-wheel nudges in assistive driving). Such interventions can mitigate immediate errors, but long-term improvement requires addressing the underlying misconceptions that cause repeated mistakes. We introduce SENSEI, a framework that infers user misconceptions from interaction behavior and provides targeted, minimal yet sufficient suggestions to correct them. Our approach departs from action- or trajectory-level interventions by operating over a structured knowledge representation to localize and correct the sources of erroneous behavior. Across three long-horizon tasks with diverse misconceptions and corresponding behaviors, SENSEI demonstrates zero-shot compositional generalization, disentangling multiple overlapping misconceptions despite training only on single-misconception cases. A user study further shows that our method identifies real human misconceptions and provides effective guidance that improves long-horizon task performance, successfully correcting $90\%$ of student misconceptions. Code and project page are available at https://misoshiruseijin.github.io/SENSEI/.

翻译：在人类与人工智能的协作中，AI助手通常通过行为反馈（例如辅助驾驶中的警报或方向盘提示）来纠正次优的人类行动。此类干预可缓解即时错误，但长期改进需要解决导致重复失误的根本性误解。我们提出SENSEI框架，该框架从交互行为推断用户误解，并提供针对性、最小且充分的建议以纠正这些误解。我们的方法超越了行动或轨迹层面的干预，通过结构化知识表征来定位并纠正错误行为的根源。在包含多样误解及相应行为的三个长周期任务中，SENSEI展现出零样本组合泛化能力，尽管仅基于单误解案例训练，仍能解耦多个重叠误解。用户研究进一步表明，我们的方法能识别真实的人类误解，并提供有效指导以提升长周期任务表现，成功纠正了90%的学生误解。代码及项目页面见 https://misoshiruseijin.github.io/SENSEI/。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

可解释人工智能（XAI）：从内在可解释性到大语言模型

专知会员服务

34+阅读 · 2025年1月20日

《人工智能辅助决策面临的三大挑战》最新33页

专知会员服务

53+阅读 · 2025年1月8日

《提高决策支持系统透明度的可解释人工智能》最新100页

专知会员服务

52+阅读 · 2024年11月28日

《人工智能辅助决策面临的三大挑战》

专知会员服务

86+阅读 · 2023年12月15日