A Framework for Measuring Appropriate Reliance on Set-Valued AI Advice

Appropriate reliance on AI advice has become a central research theme in human-AI collaboration. Existing frameworks have focused exclusively on point predictions as AI advice. However, set-valued AI advice (e.g., discrete sets or continuous intervals) is increasingly being used to communicate uncertainty and improve human decision making. In this paper, we develop the first formal framework for measuring appropriate reliance on set-valued AI advice within the sequential judge-advisor paradigm, spanning both classification and regression tasks. For classification, we first introduce the dimensions that are necessary for evaluating set-valued AI advice. We then define two metrics: correct reliance rate on AI and correct reliance rate on self, which jointly characterize appropriate reliance in this setting. For regression, we introduce quantity of AI reliance and quality of AI reliance, which respectively measure whether a decision maker utilized the AI advice and whether their reliance helped them get closer to the ground truth relative to their initial estimate. Through the application of our framework, we demonstrate how these metrics capture important nuances in human-AI collaboration that existing measures overlook.

翻译：对人工智能建议的恰当依赖已成为人机协作领域的核心研究主题。现有框架仅关注点预测型人工智能建议。然而，集合型人工智能建议（如离散集合或连续区间）正越来越多地被用于传递不确定性并改善人类决策。本文首次在序贯裁判-顾问范式下，构建了适用于分类与回归任务的集合型人工智能建议恰当依赖的正式衡量框架。针对分类任务，我们首先提出评估集合型人工智能建议的必要维度，进而定义两个指标：对人工智能的正确依赖率与对自身的正确依赖率，二者共同刻画该场景下的恰当依赖特征。针对回归任务，我们引入人工智能依赖数量与依赖质量两个概念，分别衡量决策者是否利用了人工智能建议，以及这种依赖是否帮助其初始估计更接近真实值。通过应用该框架，我们证明了这些指标能够捕捉现有度量方法所忽视的人机协作重要细微差异。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

《人工智能使能系统可靠性框架》

专知会员服务

20+阅读 · 4月27日

《人工智能辅助决策中信任的时间演化》225页

专知会员服务

25+阅读 · 2025年5月12日

《影响对人工智能决策支持系统依赖度的关键因素》304页

专知会员服务

29+阅读 · 2025年4月24日

《人类-人工智能握手框架：人与人工智能合作的双向方法》

专知会员服务

40+阅读 · 2025年2月5日