Decision-aid or Controller? Steering Human Decision Makers with Algorithms - 专知论文

会员服务 ·

0

算法 · 决策函数 · 均衡 · 控制器 · 映射 ·

2023 年 3 月 23 日

Decision-aid or Controller? Steering Human Decision Makers with Algorithms

翻译：决策辅助还是控制者？用算法引导人类决策者

Ruqing Xu,Sarah Dean

Algorithms are used to aid human decision makers by making predictions and recommending decisions. Currently, these algorithms are trained to optimize prediction accuracy. What if they were optimized to control final decisions? In this paper, we study a decision-aid algorithm that learns about the human decision maker and provides ''personalized recommendations'' to influence final decisions. We first consider fixed human decision functions which map observable features and the algorithm's recommendations to final decisions. We characterize the conditions under which perfect control over final decisions is attainable. Under fairly general assumptions, the parameters of the human decision function can be identified from past interactions between the algorithm and the human decision maker, even when the algorithm was constrained to make truthful recommendations. We then consider a decision maker who is aware of the algorithm's manipulation and responds strategically. By posing the setting as a variation of the cheap talk game [Crawford and Sobel, 1982], we show that all equilibria are partition equilibria where only coarse information is shared: the algorithm recommends an interval containing the ideal decision. We discuss the potential applications of such algorithms and their social implications.

翻译：算法通过作出预测和推荐决策来辅助人类决策者。当前，这些算法被训练以优化预测准确性。如果它们被优化以控制最终决策会怎样？本文研究了一种学习人类决策者特征并提供"个性化推荐"以影响最终决策的决策辅助算法。我们首先考虑固定的人类决策函数，该函数将可观测特征与算法推荐映射至最终决策。我们刻画了可实现最终决策完美控制的条件。在相当普遍的假设下，即使算法曾受限于作出真实推荐，人类决策函数的参数仍可从算法与人类决策者过往交互中识别。随后我们考虑意识到算法操纵并作出策略性回应的决策者。通过将该场景设定为廉价谈话博弈[Crawford and Sobel, 1982]的变体，我们证明所有均衡均为仅共享粗略信息的分区均衡：算法推荐包含理想决策的区间。我们讨论了此类算法的潜在应用及其社会影响。

0

相关内容

在数学和计算机科学之中，算法（Algorithm）为一个计算的具体步骤，常用于计算、数据处理和自动推理。精确而言，算法是一个表示为有限长列表的有效方法。算法应包含清晰定义的指令用于计算函数。来自维基百科：算法

Nat. Biotechnol. | 一个综合的SARS-CoV-2-human蛋白-蛋白相互作用组

Nat. Biotechnol. | 一个综合的SARS-CoV-2-human蛋白-蛋白相互作用组

专知会员服务

3+阅读 · 2022年10月31日

258页简单学算法！《grokking算法图解指南》，grokking algorithms: An illustrated guide for programmers and other curious people

258页简单学算法！《grokking算法图解指南》，grokking algorithms: An illustrated guide for programmers and other curious people

专知会员服务

44+阅读 · 2022年4月5日

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

专知会员服务

118+阅读 · 2022年3月18日

【干货书】强化学习算法，98页pdf综合讲解人工智能和机器学习

【干货书】强化学习算法，98页pdf综合讲解人工智能和机器学习

专知会员服务

66+阅读 · 2021年2月21日

【斯坦福新书】决策算法，464页pdf，Algorithms for Decision Making

【斯坦福新书】决策算法，464页pdf，Algorithms for Decision Making

专知会员服务

124+阅读 · 2020年12月7日

【KDD2020】具有条件公平性的算法决策，Algorithmic Decision Making with Conditional Fairness

【KDD2020】具有条件公平性的算法决策，Algorithmic Decision Making with Conditional Fairness

专知会员服务

22+阅读 · 2020年6月19日

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

专知会员服务

22+阅读 · 2020年1月15日

【Facebook|AAAI2020】在合作的部分可观察博弈中通过搜索改进策略（Improving Policies via Search in Cooperative Partially Observable Games）

【Facebook|AAAI2020】在合作的部分可观察博弈中通过搜索改进策略（Improving Policies via Search in Cooperative Partially Observable Games）

专知会员服务

16+阅读 · 2019年12月10日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

使用 TensorFlow Lite Searcher Library 实现设备端文本到图像搜索

使用 TensorFlow Lite Searcher Library 实现设备端文本到图像搜索

TensorFlow

0+阅读 · 2022年6月1日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】基于机器人的视觉惯性里程计（IROS2018-10）

【泡泡一分钟】基于机器人的视觉惯性里程计（IROS2018-10）

泡泡机器人SLAM

13+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新七篇视觉问答（VQA）相关论文—差别注意力机制、视觉问题推理、视觉对话、数据可视化、记忆增强网络、显式推理

【论文推荐】最新七篇视觉问答（VQA）相关论文—差别注意力机制、视觉问题推理、视觉对话、数据可视化、记忆增强网络、显式推理

专知

17+阅读 · 2018年4月19日

可解释的CNN

可解释的CNN

CreateAMind

18+阅读 · 2017年10月5日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

矮牵牛DUF620蛋白家族基因PhADR1的功能及调控机理解析

国家自然科学基金

0+阅读 · 2015年12月31日

救援物资库存系统的VMI策略研究

国家自然科学基金

1+阅读 · 2013年12月31日

网络环境下非线性时变随机系统的最优递推滤波研究

国家自然科学基金

0+阅读 · 2013年12月31日

低交叉极化共形天线阵列综合的混合DE算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

热电磁流动和热电磁力对液/固界面稳定性和枝晶生长的影响

国家自然科学基金

0+阅读 · 2012年12月31日

Riemann-Hilbert 方法和随机矩阵谱分析中的 Painleve 渐近

国家自然科学基金

0+阅读 · 2012年12月31日

基于Hamilton方法的电动汽车驱动系统能量动态优化和控制

国家自然科学基金

0+阅读 · 2009年12月31日

基因调控网络的鲁棒随机动力学分析与综合

国家自然科学基金

0+阅读 · 2008年12月31日

“#22810;级分区”#22478;市交通出行诱导系统规划及诱导策略研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于智能多模型粒子滤波的运动物体状态估计研究

国家自然科学基金

0+阅读 · 2008年12月31日

Prompt-Tuning Decision Transformer with Preference Ranking

Arxiv

0+阅读 · 2023年5月16日

Bi-Objective Lexicographic Optimization in Markov Decision Processes with Related Objectives

Arxiv

0+阅读 · 2023年5月16日

On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective

Arxiv

1+阅读 · 2023年5月16日

Centralized Model-Predictive Control with Human-Driver Interaction for Platooning

Arxiv

0+阅读 · 2023年5月16日

More Like Real World Game Challenge for Partially Observable Multi-Agent Cooperation

Arxiv

0+阅读 · 2023年5月15日

Theoretical Analyses of Evolutionary Algorithms on Time-Linkage OneMax with General Weights

Arxiv

0+阅读 · 2023年5月11日

A Survey of Decision Making in Adversarial Games

Arxiv

85+阅读 · 2022年7月16日

Dynamic neighbourhood optimisation for task allocation using multi-agent

Arxiv

102+阅读 · 2022年5月11日

The Conflict Between Explainable and Accountable Decision-Making Algorithms

Arxiv

31+阅读 · 2022年5月11日

Reasoning on Knowledge Graphs with Debate Dynamics

Reasoning on Knowledge Graphs with Debate Dynamics

Arxiv

14+阅读 · 2020年1月2日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

3+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

4+阅读 · 6月18日

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

专知会员服务

9+阅读 · 6月18日

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

专知会员服务

8+阅读 · 6月18日

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

专知会员服务

5+阅读 · 6月17日

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

专知会员服务

7+阅读 · 6月17日

学习数据的几何：形状空间分析数学综述

学习数据的几何：形状空间分析数学综述

专知会员服务

6+阅读 · 6月17日

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

专知会员服务

10+阅读 · 6月17日

定向能反无人机系统最新发展动态

定向能反无人机系统最新发展动态

专知会员服务

7+阅读 · 6月17日

从燃煤战舰到算法战争：水面指挥的永恒要求

从燃煤战舰到算法战争：水面指挥的永恒要求

专知会员服务

4+阅读 · 6月17日

《短程弹道再入飞行器拦截时间中的一项异常现象》

《短程弹道再入飞行器拦截时间中的一项异常现象》

专知会员服务

6+阅读 · 6月17日

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

专知会员服务

7+阅读 · 6月17日

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

专知会员服务

6+阅读 · 6月17日

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

专知会员服务

5+阅读 · 6月17日

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

专知会员服务

6+阅读 · 6月16日

相关VIP内容

Nat. Biotechnol. | 一个综合的SARS-CoV-2-human蛋白-蛋白相互作用组

Nat. Biotechnol. | 一个综合的SARS-CoV-2-human蛋白-蛋白相互作用组

专知会员服务

3+阅读 · 2022年10月31日

258页简单学算法！《grokking算法图解指南》，grokking algorithms: An illustrated guide for programmers and other curious people

258页简单学算法！《grokking算法图解指南》，grokking algorithms: An illustrated guide for programmers and other curious people

专知会员服务

44+阅读 · 2022年4月5日

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

【多目标多智能体系统决策】196页PDF布鲁塞尔自由大学博士论文，Decision Making in Multi-Objective Multi-Agent Systems——A Utility-Based Perspective

专知会员服务

118+阅读 · 2022年3月18日

【干货书】强化学习算法，98页pdf综合讲解人工智能和机器学习

【干货书】强化学习算法，98页pdf综合讲解人工智能和机器学习

专知会员服务

66+阅读 · 2021年2月21日

【斯坦福新书】决策算法，464页pdf，Algorithms for Decision Making

【斯坦福新书】决策算法，464页pdf，Algorithms for Decision Making

专知会员服务

124+阅读 · 2020年12月7日

【KDD2020】具有条件公平性的算法决策，Algorithmic Decision Making with Conditional Fairness

【KDD2020】具有条件公平性的算法决策，Algorithmic Decision Making with Conditional Fairness

专知会员服务

22+阅读 · 2020年6月19日

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

专知会员服务

22+阅读 · 2020年1月15日

【Facebook|AAAI2020】在合作的部分可观察博弈中通过搜索改进策略（Improving Policies via Search in Cooperative Partially Observable Games）

【Facebook|AAAI2020】在合作的部分可观察博弈中通过搜索改进策略（Improving Policies via Search in Cooperative Partially Observable Games）

专知会员服务

16+阅读 · 2019年12月10日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

相关资讯

使用 TensorFlow Lite Searcher Library 实现设备端文本到图像搜索

使用 TensorFlow Lite Searcher Library 实现设备端文本到图像搜索

TensorFlow

0+阅读 · 2022年6月1日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】基于机器人的视觉惯性里程计（IROS2018-10）

【泡泡一分钟】基于机器人的视觉惯性里程计（IROS2018-10）

泡泡机器人SLAM

13+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新七篇视觉问答（VQA）相关论文—差别注意力机制、视觉问题推理、视觉对话、数据可视化、记忆增强网络、显式推理

【论文推荐】最新七篇视觉问答（VQA）相关论文—差别注意力机制、视觉问题推理、视觉对话、数据可视化、记忆增强网络、显式推理

专知

17+阅读 · 2018年4月19日

可解释的CNN

可解释的CNN

CreateAMind

18+阅读 · 2017年10月5日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Prompt-Tuning Decision Transformer with Preference Ranking

Arxiv

0+阅读 · 2023年5月16日

Bi-Objective Lexicographic Optimization in Markov Decision Processes with Related Objectives

Arxiv

0+阅读 · 2023年5月16日

On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective

Arxiv

1+阅读 · 2023年5月16日

Centralized Model-Predictive Control with Human-Driver Interaction for Platooning

Arxiv

0+阅读 · 2023年5月16日

More Like Real World Game Challenge for Partially Observable Multi-Agent Cooperation

Arxiv

0+阅读 · 2023年5月15日

Theoretical Analyses of Evolutionary Algorithms on Time-Linkage OneMax with General Weights

Arxiv

0+阅读 · 2023年5月11日

A Survey of Decision Making in Adversarial Games

Arxiv

85+阅读 · 2022年7月16日

Dynamic neighbourhood optimisation for task allocation using multi-agent

Arxiv

102+阅读 · 2022年5月11日

The Conflict Between Explainable and Accountable Decision-Making Algorithms

Arxiv

31+阅读 · 2022年5月11日

Reasoning on Knowledge Graphs with Debate Dynamics

Reasoning on Knowledge Graphs with Debate Dynamics

Arxiv

14+阅读 · 2020年1月2日

相关基金

矮牵牛DUF620蛋白家族基因PhADR1的功能及调控机理解析

国家自然科学基金

0+阅读 · 2015年12月31日

救援物资库存系统的VMI策略研究

国家自然科学基金

1+阅读 · 2013年12月31日

网络环境下非线性时变随机系统的最优递推滤波研究

国家自然科学基金

0+阅读 · 2013年12月31日

低交叉极化共形天线阵列综合的混合DE算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

热电磁流动和热电磁力对液/固界面稳定性和枝晶生长的影响

国家自然科学基金

0+阅读 · 2012年12月31日

Riemann-Hilbert 方法和随机矩阵谱分析中的 Painleve 渐近

国家自然科学基金

0+阅读 · 2012年12月31日

基于Hamilton方法的电动汽车驱动系统能量动态优化和控制

国家自然科学基金

0+阅读 · 2009年12月31日

基因调控网络的鲁棒随机动力学分析与综合

国家自然科学基金

0+阅读 · 2008年12月31日

“#22810;级分区”#22478;市交通出行诱导系统规划及诱导策略研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于智能多模型粒子滤波的运动物体状态估计研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员