A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future

Artificial intelligence (AI) has rapidly developed through advancements in computational power and the growth of massive datasets. However, this progress has also heightened challenges in interpreting the "black-box" nature of AI models. To address these concerns, eXplainable AI (XAI) has emerged with a focus on transparency and interpretability to enhance human understanding and trust in AI decision-making processes. In the context of multimodal data fusion and complex reasoning scenarios, the proposal of Multimodal eXplainable AI (MXAI) integrates multiple modalities for prediction and explanation tasks. Meanwhile, the advent of Large Language Models (LLMs) has led to remarkable breakthroughs in natural language processing, yet their complexity has further exacerbated the issue of MXAI. To gain key insights into the development of MXAI methods and provide crucial guidance for building more transparent, fair, and trustworthy AI systems, we review the MXAI methods from a historical perspective and categorize them across four eras: traditional machine learning, deep learning, discriminative foundation models, and generative LLMs. We also review evaluation metrics and datasets used in MXAI research, concluding with a discussion of future challenges and directions. A project related to this review has been created at https://github.com/ShilinSun/mxai_review.

翻译：人工智能（AI）在计算能力的进步和海量数据集的推动下迅速发展。然而，这一进展也加剧了AI模型“黑箱”特性在解释方面所面临的挑战。为解决这些问题，可解释人工智能（XAI）应运而生，其关注透明度和可解释性，旨在增强人类对AI决策过程的理解与信任。在多模态数据融合与复杂推理场景下，多模态可解释人工智能（MXAI）的提出整合了多种模态以完成预测与解释任务。与此同时，大语言模型（LLMs）的出现为自然语言处理领域带来了显著突破，但其复杂性进一步加剧了MXAI所面临的问题。为深入理解MXAI方法的发展脉络，并为构建更透明、公平、可信的AI系统提供关键指导，本文从历史视角回顾了MXAI方法，并将其划分为四个时代：传统机器学习、深度学习、判别式基础模型以及生成式大语言模型。我们还综述了MXAI研究中使用的评估指标与数据集，最后探讨了未来的挑战与发展方向。与本综述相关的项目已创建于 https://github.com/ShilinSun/mxai_review。

相关内容

关注 7104

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日