We often see the term explainable in the titles of papers that describe applications based on artificial intelligence (AI). However, the literature in explainable artificial intelligence (XAI) indicates that explanations in XAI are application- and domain-specific, hence requiring evaluation whenever they are employed to explain a model that makes decisions for a specific application problem. Additionally, the literature reveals that the performance of post-hoc methods, particularly feature attribution methods, varies substantially hinting that they do not represent a solution to AI explainability. Therefore, when using XAI methods, the quality and suitability of their information outputs should be evaluated within the specific application. For these reasons, we used a scoping review methodology to investigate papers that apply AI models and adopt methods to generate post-hoc explanations while referring to said models as explainable. This paper investigates whether the term explainable model is adopted by authors under the assumption that incorporating a post-hoc XAI method suffices to characterize a model as explainable. To inspect this problem, our review analyzes whether these papers conducted evaluations. We found that 81% of the application papers that refer to their approaches as an explainable model do not conduct any form of evaluation on the XAI method they used.
翻译:我们常在描述基于人工智能(AI)应用的论文标题中见到“可解释”一词。然而,可解释人工智能(XAI)领域的文献表明,XAI中的解释具有应用特定性与领域特定性,因此每当将其用于解释针对特定应用问题做出决策的模型时,都需要进行评估。此外,文献揭示,事后方法(尤其是特征归因方法)的性能差异显著,暗示其并非AI可解释性的解决方案。因此,在使用XAI方法时,需在具体应用场景中评估其信息输出的质量与适用性。基于这些原因,我们采用范围综述方法,调查了那些应用AI模型并采用事后解释方法生成解释,同时将所述模型称为“可解释”的论文。本文探究作者是否基于“引入事后XAI方法足以将模型定性为可解释”的假设而采用“可解释模型”这一术语。为检验该问题,我们的综述分析了这些论文是否进行了评估。结果发现,81%的将其方法称为“可解释模型”的应用论文并未对其使用的XAI方法进行任何形式的评估。