While discounted payoff games and classic games that reduce to them, like parity and mean-payoff games, are symmetric, their solutions are not. We have taken a fresh view on the constraints that optimal solutions need to satisfy, and devised a novel way to converge to them, which is entirely symmetric. It also challenges the gospel that methods for solving payoff games are either based on strategy improvement or on value iteration.
翻译:尽管折扣收益博弈及其可归约的经典博弈(如奇偶博弈和平均收益博弈)具有对称性,但其解却并非对称。我们对最优解需满足的约束条件提出了全新视角,并设计了一种完全对称的收敛方法。这一方法同时挑战了“求解收益博弈的方法要么基于策略改进,要么基于值迭代”的既定观念。