Equilibrium Computation in Extensive-Form Games with Stochastic Action Sets

Extensive-form games (EFGs) are a standard model for sequential decision-making in games. A fundamental and typically implicit assumption in EFGs is that players always have access to all of their actions at every decision point. However, in many realistic settings, certain actions might be unavailable during game-play due to exogenous stochasticity, hindering the expressivity of the standard EFG model. Given a `base' EFG, we formalize a model that allows for actions to be stochastically restricted, leading to a corresponding Extensive-Form Games with Stochastic Action Sets (EFGSAS). In EFGSAS, we derive an expansion procedure that results in an equivalent EFG, thus showing that standard strategy formalisms could require exponentially-large representations. However, under an appropriate independence assumption, we show that compact strategy representations polynomial in the size of the base EFG exist. Computationally, we introduce an algorithm called SI-CFR that minimizes sleeping internal regret, converging to Nash equilibria with high probability in two-player zero-sum EFGSAS. Finally, we utilize a stochastic approximation procedure to recover compact representations of Nash equilibria, utilizing only the iterates of SI-CFR.

翻译：扩展式博弈（Extensive-Form Games，EFGs）是建模博弈中序贯决策的标准模型。该模型通常隐含一个基本假设：玩家在每个决策点始终能够使用所有可用动作。然而，在众多现实场景中，某些动作可能在博弈过程中因外生随机性而不可用，从而限制了标准EFG模型的表达能力。基于一个“基础”EFG，我们形式化了一个允许动作受到随机限制的模型，由此得到对应的具有随机动作集的扩展式博弈（Extensive-Form Games with Stochastic Action Sets，EFGSAS）。在EFGSAS中，我们推导出一种展开过程，该过程可得到一个等价的EFG，从而表明标准策略形式化表示可能需要指数级大小的表示。然而，在适当的独立性假设下，我们证明存在以基础EFG规模为多项式的紧凑策略表示。在计算方面，我们提出一种称为SI-CFR的算法，它能最小化睡眠内部遗憾，并在两人零和EFGSAS中以高概率收敛到纳什均衡。最后，我们利用一种随机逼近过程，仅通过SI-CFR的迭代结果来恢复纳什均衡的紧凑表示。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

随机网络效用最大化在战略排队系统中的博弈论方法

专知会员服务

11+阅读 · 4月13日

《动态作战规划：军事战役的随机博弈方法》2024最新37页论文

专知会员服务

142+阅读 · 2024年3月16日

【2023新书】合作博弈论的计算方面，170页pdf

专知会员服务

72+阅读 · 2023年6月29日

【CMU博士论文】不完全信息博弈中的博弈决策学习动力学、均衡计算和复杂性，358页pdf

专知会员服务

64+阅读 · 2023年6月16日