实力评估与游戏中类人实力调整 (Strength Estimation and Human-Like Strength Adjustment in Games)

Strength estimation and adjustment are crucial in designing human-AI interactions, particularly in games where AI surpasses human players. This paper introduces a novel strength system, including a strength estimator (SE) and an SE-based Monte Carlo tree search, denoted as SE-MCTS, which predicts strengths from games and offers different playing strengths with human styles. The strength estimator calculates strength scores and predicts ranks from games without direct human interaction. SE-MCTS utilizes the strength scores in a Monte Carlo tree search to adjust playing strength and style. We first conduct experiments in Go, a challenging board game with a wide range of ranks. Our strength estimator significantly achieves over 80% accuracy in predicting ranks by observing 15 games only, whereas the previous method reached 49% accuracy for 100 games. For strength adjustment, SE-MCTS successfully adjusts to designated ranks while achieving a 51.33% accuracy in aligning to human actions, outperforming a previous state-of-the-art, with only 42.56% accuracy. To demonstrate the generality of our strength system, we further apply SE and SE-MCTS to chess and obtain consistent results. These results show a promising approach to strength estimation and adjustment, enhancing human-AI interactions in games. Our code is available at https://rlg.iis.sinica.edu.tw/papers/strength-estimator.

翻译：实力评估与调整在设计与人工智能交互中至关重要，特别是在AI超越人类玩家的游戏中。本文提出了一种新颖的实力系统，包括一个实力评估器（SE）和一种基于SE的蒙特卡洛树搜索，记为SE-MCTS，该系统可从游戏中预测实力，并提供具有人类风格的不同对弈实力。实力评估器无需直接人类交互，即可从游戏中计算实力分数并预测段位。SE-MCTS在蒙特卡洛树搜索中利用这些实力分数来调整对弈实力与风格。我们首先在围棋中进行了实验，这是一种具有广泛段位范围的挑战性棋盘游戏。我们的实力评估器仅通过观察15局游戏，就在段位预测上显著实现了超过80%的准确率，而先前的方法在100局游戏中仅达到49%的准确率。在实力调整方面，SE-MCTS成功调整至指定段位，同时在模仿人类动作上达到了51.33%的准确率，优于先前仅42.56%准确率的最先进方法。为了证明我们实力系统的通用性，我们进一步将SE和SE-MCTS应用于国际象棋，并得到了一致的结果。这些结果表明了一种有前景的实力评估与调整方法，能够增强游戏中的人机交互。我们的代码可在 https://rlg.iis.sinica.edu.tw/papers/strength-estimator 获取。