As the field of explainable AI (XAI) is maturing, calls for interactive explanations for (the outputs of) AI models are growing, but the state-of-the-art predominantly focuses on static explanations. In this paper, we focus instead on interactive explanations framed as conflict resolution between agents (i.e. AI models and/or humans) by leveraging on computational argumentation. Specifically, we define Argumentative eXchanges (AXs) for dynamically sharing, in multi-agent systems, information harboured in individual agents' quantitative bipolar argumentation frameworks towards resolving conflicts amongst the agents. We then deploy AXs in the XAI setting in which a machine and a human interact about the machine's predictions. We identify and assess several theoretical properties characterising AXs that are suitable for XAI. Finally, we instantiate AXs for XAI by defining various agent behaviours, e.g. capturing counterfactual patterns of reasoning in machines and highlighting the effects of cognitive biases in humans. We show experimentally (in a simulated environment) the comparative advantages of these behaviours in terms of conflict resolution, and show that the strongest argument may not always be the most effective.
翻译:随着可解释人工智能(XAI)领域的日趋成熟,针对AI模型(输出结果)的交互式解释需求日益增长,但当前主流方法仍主要聚焦于静态解释。本文转而关注基于计算论辩的智能体(即AI模型和/或人类)间冲突消解框架下的交互式解释。具体而言,我们定义了论辩交换(AXs),用于在多智能体系统中动态共享各智能体定量双极论辩框架中的信息,以消解智能体间的冲突。随后,我们将AXs应用于XAI场景,使机器与人类围绕机器预测结果进行交互。我们识别并评估了适用于XAI的AXs所具备的若干理论特性。最终,通过定义多种智能体行为(例如,捕捉机器中的反事实推理模式、突出人类认知偏差的影响),将XAI领域的AXs具体化。基于模拟环境的实验表明,这些行为在冲突消解方面具有比较优势,并证明最强论点并非总是最有效的。