加性竞争秘书问题 (Additively Competitive Secretaries) - 专知论文

会员服务 ·

0

最优 · 算法 · 最坏情况 · 竞争比 · 在线 ·

Additively Competitive Secretaries

翻译：加性竞争秘书问题

Mohammad Mahdian,Jieming Mao,Enze Sun,Kangning Wang,Yifan Wang

In the secretary problem, a set of secretary candidates arrive in a uniformly random order and reveal their values one by one. A company, who can only hire one candidate and hopes to maximize the expected value of its hire, needs to make irrevocable online decisions about whether to hire the current candidate. The classical framework of evaluating a policy is to compute its worst-case competitive ratio against the optimal solution in hindsight, and there the best policy -- the ``$1/e$ law'' -- has a competitive ratio of $1/e$. We propose an alternative evaluation framework through the lens of regret -- the worst-case additive difference between the optimal hindsight solution and the expected performance of the policy, assuming that each value is normalized between $0$ and $1$. The $1/e$ law for the classical framework has a regret of $1 - 1/e \approx 0.632$; by contrast, we show that the class of ``pricing curves'' algorithms can guarantee a regret of at most $1/4 = 0.25$ (which is tight within the class), and the class of ``best-only pricing curves'' algorithms can guarantee a regret of at most $0.190$ (with a lower bound of $0.171$). In addition, we show that in general, no policy can give a regret guarantee better than $0.152$. Finally, we discuss other objectives in our regret-minimization framework, such as selecting the top-$k$ candidates for $k > 1$, or maximizing revenue during the selection process.

翻译：在秘书问题中，一组秘书候选人以均匀随机顺序到达，并逐一揭示其价值。一家公司只能雇用一名候选人，并希望最大化其雇用候选人的期望价值，因此需要在是否雇用当前候选人方面做出不可撤销的在线决策。评估策略的经典框架是计算其在最坏情况下相对于事后最优解的竞争比，其中最优策略——"1/e法则"——的竞争比为1/e。我们提出一种通过遗憾视角的替代评估框架——假设每个价值已归一化到0与1之间，遗憾定义为事后最优解与策略期望性能之间的最坏情况加性差异。经典框架下的1/e法则遗憾为1 - 1/e ≈ 0.632；相比之下，我们证明"定价曲线"算法类可保证至多1/4 = 0.25的遗憾（在该算法类内该界是紧的），而"仅最优定价曲线"算法类可保证至多0.190的遗憾（下界为0.171）。此外，我们证明在一般情况下，任何策略都无法给出优于0.152的遗憾保证。最后，我们讨论了遗憾最小化框架下的其他目标，例如为k > 1选择前k名候选人，或在选择过程中最大化收益。

0

相关内容

《战略智能体与有限反馈下的序贯决策》211页

《战略智能体与有限反馈下的序贯决策》211页

专知会员服务

36+阅读 · 2025年5月7日

【2023新书】使用博弈论进行决策，215页pdf

【2023新书】使用博弈论进行决策，215页pdf

专知会员服务

131+阅读 · 2023年4月19日

《可解释决策算法和可问责决策算法之间的冲突》2022最新18页论文

《可解释决策算法和可问责决策算法之间的冲突》2022最新18页论文

专知会员服务

50+阅读 · 2022年11月20日

《不确定性下的国防能力组合选择》2022最新41页技术报告，加拿大国防研究与发展部

《不确定性下的国防能力组合选择》2022最新41页技术报告，加拿大国防研究与发展部

专知会员服务

32+阅读 · 2022年10月26日

《军事作战研究中的近似动态规划（强化学习）应用综述》加拿大国防研究与发展部、加拿大联合作战司令部

《军事作战研究中的近似动态规划（强化学习）应用综述》加拿大国防研究与发展部、加拿大联合作战司令部

专知会员服务

140+阅读 · 2022年5月17日

如何搞定面试中深度学习问题？这本书《深度学习面试指南》书401页pdf简介实战中DL问题与解决答案

如何搞定面试中深度学习问题？这本书《深度学习面试指南》书401页pdf简介实战中DL问题与解决答案

专知会员服务

145+阅读 · 2021年10月22日

【USTC】对话推荐系统的进展和挑战:综述论文，30页pdf

【USTC】对话推荐系统的进展和挑战:综述论文，30页pdf

专知会员服务

22+阅读 · 2021年1月27日

LinkedIn《贝叶斯优化推荐系统》，IJCAI报告，142页ppt

LinkedIn《贝叶斯优化推荐系统》，IJCAI报告，142页ppt

专知会员服务

52+阅读 · 2021年1月11日

【WWW2020】解决推荐系统中目标客户失真问题，Addressing the Target Customer Distortion Problem in Recommender Systems

【WWW2020】解决推荐系统中目标客户失真问题，Addressing the Target Customer Distortion Problem in Recommender Systems

专知会员服务

10+阅读 · 2020年4月4日

【NLPCC2019 Tutorial】个性化推荐的基础与趋势（Foundations and Trends for Personalized Recommendation）附145页ppt，清华大学张敏老师

【NLPCC2019 Tutorial】个性化推荐的基础与趋势（Foundations and Trends for Personalized Recommendation）附145页ppt，清华大学张敏老师

专知会员服务

68+阅读 · 2019年11月22日

【佐治亚理工博士论文】基于策略智能体和有限反馈的序列决策，211页pdf

【佐治亚理工博士论文】基于策略智能体和有限反馈的序列决策，211页pdf

专知

38+阅读 · 2023年4月13日

推荐！《不确定性下的作战决策：推理、序贯和对抗性方法》美国空军293页博士论文，含代码

推荐！《不确定性下的作战决策：推理、序贯和对抗性方法》美国空军293页博士论文，含代码

专知

47+阅读 · 2022年11月16日

如何进AI大厂？这本书400页《深度学习面试指南》书202页pdf简介实战中DL问题与解决答案，

如何进AI大厂？这本书400页《深度学习面试指南》书202页pdf简介实战中DL问题与解决答案，

专知

10+阅读 · 2022年1月5日

面试/提升必看丨30个困扰B端产品经理的常见问题（内附资料）

面试/提升必看丨30个困扰B端产品经理的常见问题（内附资料）

人人都是产品经理

14+阅读 · 2020年10月4日

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

深度强化学习实验室

13+阅读 · 2020年8月23日

强化学习的两大话题之一，仍有极大探索空间

强化学习的两大话题之一，仍有极大探索空间

AI科技评论

22+阅读 · 2020年8月22日

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

专知

12+阅读 · 2020年5月14日

【WWW2020-新加坡国立大学】知识图谱强化负采样的推荐系统，Reinforced Negative Sampling

【WWW2020-新加坡国立大学】知识图谱强化负采样的推荐系统，Reinforced Negative Sampling

专知

22+阅读 · 2020年3月14日

作为字节跳动的研发面试官，有些话我不得不说！

作为字节跳动的研发面试官，有些话我不得不说！

互联网架构师

12+阅读 · 2019年4月22日

推荐算法：Match与Rank模型的交织配合

推荐算法：Match与Rank模型的交织配合

从0到1

15+阅读 · 2017年12月18日

集成专家意见的在线投资组合策略设计及竞争性能分析

国家自然科学基金

0+阅读 · 2015年12月31日

两类保密排序问题的算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

求解一类公平疏散问题的高性能混合算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

多类秘书问题的最优算法设计及竞争比分析

国家自然科学基金

0+阅读 · 2015年12月31日

社会网秘密共享中的关键问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

具有服务等级的平行机在线排序问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

加工时间可控排序问题及依赖资源指派问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

考虑共谋行为的多属性采购拍卖理论与优化方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

官员异质性、社会信任与企业资源配置

国家自然科学基金

0+阅读 · 2014年12月31日

两类非马氏保险模型下的最优问题以及公司合并问题

国家自然科学基金

0+阅读 · 2014年12月31日

Necessary President in Elections with Parties

Arxiv

0+阅读 · 2月11日

Online Decision Making with Fairness over Time

Arxiv

0+阅读 · 2月11日

Selfish routing games with priority lanes

Arxiv

0+阅读 · 2月6日

Sequential Selection with Expirations

Arxiv

0+阅读 · 2月3日

A two-player version of the assignment problem

Arxiv

0+阅读 · 2月2日

Persuasive Privacy

Arxiv

0+阅读 · 1月30日

Single-Winner Voting on Matchings

Arxiv

0+阅读 · 1月27日

Secret Sharing Schemes from Correlated Random Variables and Rate-Limited Public Communication

Arxiv

0+阅读 · 1月23日

Approximation Schemes for Sequential Hiring Problems

Arxiv

0+阅读 · 1月19日

Secret sharing with additive access structures from correlated random variables

Arxiv

0+阅读 · 1月14日

VIP会员

文章信息

相关主题

相关VIP内容

《战略智能体与有限反馈下的序贯决策》211页

《战略智能体与有限反馈下的序贯决策》211页

专知会员服务

36+阅读 · 2025年5月7日

【2023新书】使用博弈论进行决策，215页pdf

【2023新书】使用博弈论进行决策，215页pdf

专知会员服务

131+阅读 · 2023年4月19日

《可解释决策算法和可问责决策算法之间的冲突》2022最新18页论文

《可解释决策算法和可问责决策算法之间的冲突》2022最新18页论文

专知会员服务

50+阅读 · 2022年11月20日

《不确定性下的国防能力组合选择》2022最新41页技术报告，加拿大国防研究与发展部

《不确定性下的国防能力组合选择》2022最新41页技术报告，加拿大国防研究与发展部

专知会员服务

32+阅读 · 2022年10月26日

《军事作战研究中的近似动态规划（强化学习）应用综述》加拿大国防研究与发展部、加拿大联合作战司令部

《军事作战研究中的近似动态规划（强化学习）应用综述》加拿大国防研究与发展部、加拿大联合作战司令部

专知会员服务

140+阅读 · 2022年5月17日

如何搞定面试中深度学习问题？这本书《深度学习面试指南》书401页pdf简介实战中DL问题与解决答案

如何搞定面试中深度学习问题？这本书《深度学习面试指南》书401页pdf简介实战中DL问题与解决答案

专知会员服务

145+阅读 · 2021年10月22日

【USTC】对话推荐系统的进展和挑战:综述论文，30页pdf

【USTC】对话推荐系统的进展和挑战:综述论文，30页pdf

专知会员服务

22+阅读 · 2021年1月27日

LinkedIn《贝叶斯优化推荐系统》，IJCAI报告，142页ppt

LinkedIn《贝叶斯优化推荐系统》，IJCAI报告，142页ppt

专知会员服务

52+阅读 · 2021年1月11日

【WWW2020】解决推荐系统中目标客户失真问题，Addressing the Target Customer Distortion Problem in Recommender Systems

【WWW2020】解决推荐系统中目标客户失真问题，Addressing the Target Customer Distortion Problem in Recommender Systems

专知会员服务

10+阅读 · 2020年4月4日

【NLPCC2019 Tutorial】个性化推荐的基础与趋势（Foundations and Trends for Personalized Recommendation）附145页ppt，清华大学张敏老师

【NLPCC2019 Tutorial】个性化推荐的基础与趋势（Foundations and Trends for Personalized Recommendation）附145页ppt，清华大学张敏老师

专知会员服务

68+阅读 · 2019年11月22日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基于自适应表征的高效视觉建模

《多域作战中融合网络、电子战与动能机动》

AI智能体时代大模型安全风险与攻防新挑战

迈向个性化大语言模型驱动的智能体：基础、评估与未来方向

相关资讯

【佐治亚理工博士论文】基于策略智能体和有限反馈的序列决策，211页pdf

【佐治亚理工博士论文】基于策略智能体和有限反馈的序列决策，211页pdf

专知

38+阅读 · 2023年4月13日

推荐！《不确定性下的作战决策：推理、序贯和对抗性方法》美国空军293页博士论文，含代码

推荐！《不确定性下的作战决策：推理、序贯和对抗性方法》美国空军293页博士论文，含代码

专知

47+阅读 · 2022年11月16日

如何进AI大厂？这本书400页《深度学习面试指南》书202页pdf简介实战中DL问题与解决答案，

如何进AI大厂？这本书400页《深度学习面试指南》书202页pdf简介实战中DL问题与解决答案，

专知

10+阅读 · 2022年1月5日

面试/提升必看丨30个困扰B端产品经理的常见问题（内附资料）

面试/提升必看丨30个困扰B端产品经理的常见问题（内附资料）

人人都是产品经理

14+阅读 · 2020年10月4日

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

探索(Exploration)还是利用(Exploitation)？强化学习如何tradeoff？

深度强化学习实验室

13+阅读 · 2020年8月23日

强化学习的两大话题之一，仍有极大探索空间

强化学习的两大话题之一，仍有极大探索空间

AI科技评论

22+阅读 · 2020年8月22日

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

对话管理的综述论文:最近的进展和挑战，A Survey on Dialog Management

专知

12+阅读 · 2020年5月14日

【WWW2020-新加坡国立大学】知识图谱强化负采样的推荐系统，Reinforced Negative Sampling

【WWW2020-新加坡国立大学】知识图谱强化负采样的推荐系统，Reinforced Negative Sampling

专知

22+阅读 · 2020年3月14日

作为字节跳动的研发面试官，有些话我不得不说！

作为字节跳动的研发面试官，有些话我不得不说！

互联网架构师

12+阅读 · 2019年4月22日

推荐算法：Match与Rank模型的交织配合

推荐算法：Match与Rank模型的交织配合

从0到1

15+阅读 · 2017年12月18日

相关论文

Necessary President in Elections with Parties

Arxiv

0+阅读 · 2月11日

Online Decision Making with Fairness over Time

Arxiv

0+阅读 · 2月11日

Selfish routing games with priority lanes

Arxiv

0+阅读 · 2月6日

Sequential Selection with Expirations

Arxiv

0+阅读 · 2月3日

A two-player version of the assignment problem

Arxiv

0+阅读 · 2月2日

Persuasive Privacy

Arxiv

0+阅读 · 1月30日

Single-Winner Voting on Matchings

Arxiv

0+阅读 · 1月27日

Secret Sharing Schemes from Correlated Random Variables and Rate-Limited Public Communication

Arxiv

0+阅读 · 1月23日

Approximation Schemes for Sequential Hiring Problems

Arxiv

0+阅读 · 1月19日

Secret sharing with additive access structures from correlated random variables

Arxiv

0+阅读 · 1月14日

相关基金

集成专家意见的在线投资组合策略设计及竞争性能分析

国家自然科学基金

0+阅读 · 2015年12月31日

两类保密排序问题的算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

求解一类公平疏散问题的高性能混合算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

多类秘书问题的最优算法设计及竞争比分析

国家自然科学基金

0+阅读 · 2015年12月31日

社会网秘密共享中的关键问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

具有服务等级的平行机在线排序问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

加工时间可控排序问题及依赖资源指派问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

考虑共谋行为的多属性采购拍卖理论与优化方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

官员异质性、社会信任与企业资源配置

国家自然科学基金

0+阅读 · 2014年12月31日

两类非马氏保险模型下的最优问题以及公司合并问题

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员