ExplainReduce: Generating global explanations from many local explanations

from arxiv, 21 pages with a 36 page appendix, 8 + 39 figures, 1+1 tables. The datasets and source code used in the paper are available at https://github.com/edahelsinki/explainreduce. Accepted for publication in the 4th World Conference on eXplainable Artificial Intelligence (2026)

Most commonly used non-linear machine learning methods are closed-box models, uninterpretable to humans. The field of explainable artificial intelligence (XAI) aims to develop tools to examine the inner workings of these closed boxes. An often-used model-agnostic approach to XAI involves using simple models as local approximations to produce so-called local explanations; examples of this approach include LIME, SHAP, and SLISEMAP. This paper shows how a large set of local explanations can be reduced to a small "proxy set" of simple models, which can act as a generative global explanation. This reduction procedure, ExplainReduce, can be formulated as an optimisation problem and approximated efficiently using greedy heuristics. We show that, for many problems, as few as five explanations can faithfully emulate the closed-box model and that our reduction procedure is competitive with other model aggregation methods.

翻译：最常用的非线性机器学习方法多为黑箱模型，人类难以理解其机理。可解释人工智能（XAI）领域旨在开发工具以探查这些黑箱的内部运作机制。一种常用的模型不可知XAI方法是通过使用简单模型作为局部近似来生成所谓的局部解释，例如LIME、SHAP和SLISEMAP。本文展示了如何将大量局部解释简化为一个由简单模型组成的小型"代理集"，该集合可作为生成式全局解释。这种简化过程（称为ExplainReduce）可表述为优化问题，并通过贪婪启发式算法进行高效近似求解。我们证明，对于许多问题而言，仅需五个解释便可忠实模拟黑箱模型，且我们的简化过程在模型聚合方法中具有竞争力。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

可解释强化学习综述：目标、方法与需求

专知会员服务

32+阅读 · 2025年7月19日

多模态可解释人工智能综述：过去、现在与未来

专知会员服务

45+阅读 · 2024年12月20日