We present the Multilingual Reasoning Gym, an extension of Reasoning Gym (Stojanovski et al., 2025), that procedurally generates verifiable reasoning problems across 14 languages. We translate templates for 94 tasks with native-speaker validation in 10 languages and targeted code or template adaptations to ensure linguistic naturalness. The Multilingual Reasoning Gym preserves the core benefits of the procedural generation approach used in the original Reasoning Gym, such as virtually unlimited problem instance generation and adjustable difficulty, and remains directly usable for Reinforcement Learning from Verifiable Rewards and evaluation settings. Problems in the Multilingual Reasoning Gym are parallel across languages, enabling crosslingually parallel data generation at massive scale due to the procedural nature of the environments. We release our implementation to support research into multilingual reasoning models.
翻译:我们提出了多语言推理训练场,这是对推理训练场(Stojanovski等人,2025年)的扩展,能够程序化生成涵盖14种语言的可验证推理问题。我们翻译了94个任务的模板,其中10种语言经过了母语者验证,并针对性地进行了代码或模板适配,以确保语言的自然性。多语言推理训练场保留了原始推理训练场所采用程序化生成方法的核心优势,例如近乎无限的问题实例生成能力和可调节的难度,并仍然可直接用于基于可验证奖励的强化学习和评估场景。多语言推理训练场中的问题在不同语言间保持平行,得益于环境的程序化特性,能够实现大规模跨语言平行数据生成。我们公开了实现代码,以支持多语言推理模型的研究。