Visual Reinforcement Learning (Visual RL), coupled with high-dimensional observations, has consistently confronted the long-standing challenge of out-of-distribution generalization. Despite the focus on algorithms aimed at resolving visual generalization problems, we argue that the devil is in the existing benchmarks as they are restricted to isolated tasks and generalization categories, undermining a comprehensive evaluation of agents' visual generalization capabilities. To bridge this gap, we introduce RL-ViGen: a novel Reinforcement Learning Benchmark for Visual Generalization, which contains diverse tasks and a wide spectrum of generalization types, thereby facilitating the derivation of more reliable conclusions. Furthermore, RL-ViGen incorporates the latest generalization visual RL algorithms into a unified framework, under which the experiment results indicate that no single existing algorithm has prevailed universally across tasks. Our aspiration is that RL-ViGen will serve as a catalyst in this area, and lay a foundation for the future creation of universal visual generalization RL agents suitable for real-world scenarios. Access to our code and implemented algorithms is provided at https://gemcollector.github.io/RL-ViGen/.
翻译:视觉强化学习(Visual RL)结合高维观测数据,一直面临分布外泛化这一长期挑战。尽管目前研究聚焦于解决视觉泛化问题的算法,但我们认为问题根源在于现有基准受限于孤立任务与泛化类别,难以全面评估智能体的视觉泛化能力。为弥补这一空白,我们提出RL-ViGen:一种新颖的视觉泛化强化学习基准,其包含多样化任务与广泛泛化类型,从而有助于得出更可靠的结论。此外,RL-ViGen将最新的视觉泛化强化学习算法整合至统一框架中,实验结果表明,尚无单一算法能在所有任务中普遍取得优势。我们期望RL-ViGen能成为该领域的催化剂,为未来创建适用于真实场景的通用视觉泛化强化学习智能体奠定基础。相关代码与已实现算法可通过https://gemcollector.github.io/RL-ViGen/获取。