While transformers have gained the reputation as the "Swiss army knife of AI", no one has challenged them to master the game of chess, one of the classical AI benchmarks. Simply using vision transformers (ViTs) within AlphaZero does not master the game of chess, mainly because ViTs are too slow. Even making them more efficient using a combination of MobileNet and NextViT does not beat what actually matters: a simple change of the input representation and value loss, resulting in a greater boost of up to 180 Elo points over AlphaZero.
翻译:尽管Transformer被誉为“AI领域的瑞士军刀”,但尚未有人挑战它们掌握国际象棋这一经典AI基准游戏。在AlphaZero中直接使用视觉Transformer(ViT)无法掌握国际象棋,主要原因是ViT速度过慢。即使通过结合MobileNet和NextViT提高其效率,仍无法击败真正关键的要素:仅通过改变输入表示和价值损失函数,就能比AlphaZero提升高达180 Elo点。