Diffusion Models for Reinforcement Learning: A Survey

Diffusion models have emerged as a prominent class of generative models, surpassing previous methods regarding sample quality and training stability. Recent works have shown the advantages of diffusion models in improving reinforcement learning (RL) solutions, including as trajectory planners, expressive policy classes, data synthesizers, etc. This survey aims to provide an overview of the advancements in this emerging field and hopes to inspire new avenues of research. First, we examine several challenges encountered by current RL algorithms. Then, we present a taxonomy of existing methods based on the roles played by diffusion models in RL and explore how the existing challenges are addressed. We further outline successful applications of diffusion models in various RL-related tasks while discussing the limitations of current approaches. Finally, we conclude the survey and offer insights into future research directions, focusing on enhancing model performance and applying diffusion models to broader tasks. We are actively maintaining a GitHub repository for papers and other related resources in applying diffusion models in RL: https://github.com/apexrl/Diff4RLSurvey .

翻译：扩散模型已崛起为生成模型中的突出类别，在样本质量和训练稳定性方面超越了先前方法。近期研究展示了扩散模型在改进强化学习（RL）解决方案中的优势，包括作为轨迹规划器、表达性策略类、数据合成器等。本综述旨在概述这一新兴领域的进展，并期望激发新的研究方向。首先，我们考察了当前RL算法面临的若干挑战。随后，我们基于扩散模型在RL中扮演的角色提出了现有方法的分类体系，并探讨了现有挑战是如何被解决的。我们进一步概述了扩散模型在各种RL相关任务中的成功应用，同时讨论了当前方法的局限性。最后，我们总结综述并展望未来研究方向，重点关注提升模型性能及将扩散模型应用于更广泛任务。我们正在积极维护一个GitHub仓库，收录扩散模型在RL中应用的相关论文及其他资源：https://github.com/apexrl/Diff4RLSurvey 。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/