基于不确定性的自适应扩散规划实现动态障碍物规避 (Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion)

By framing reinforcement learning as a sequence modeling problem, recent work has enabled the use of generative models, such as diffusion models, for planning. While these models are effective in predicting long-horizon state trajectories in deterministic environments, they face challenges in dynamic settings with moving obstacles. Effective collision avoidance demands continuous monitoring and adaptive decision-making. While replanning at every timestep could ensure safety, it introduces substantial computational overhead due to the repetitive prediction of overlapping state sequences -- a process that is particularly costly with diffusion models, known for their intensive iterative sampling procedure. We propose an adaptive generative planning approach that dynamically adjusts replanning frequency based on the uncertainty of action predictions. Our method minimizes the need for frequent, computationally expensive, and redundant replanning while maintaining robust collision avoidance performance. In experiments, we obtain a 13.5% increase in the mean trajectory length and a 12.7% increase in mean reward over long-horizon planning, indicating a reduction in collision rates and an improved ability to navigate the environment safely.

翻译：通过将强化学习构建为序列建模问题，近期研究使得生成模型（如扩散模型）能够用于规划任务。尽管这些模型在确定性环境中预测长时程状态轨迹方面表现优异，但在存在移动障碍物的动态场景中面临挑战。有效的碰撞规避需要持续的环境监测与自适应决策。虽然每个时间步都重新规划可以确保安全性，但由于需要重复预测重叠的状态序列，这会带来巨大的计算开销——对于以密集迭代采样过程著称的扩散模型而言，这一过程尤其昂贵。我们提出一种自适应生成式规划方法，该方法根据动作预测的不确定性动态调整重新规划频率。我们的方法在保持鲁棒碰撞规避性能的同时，最大限度地减少了频繁、计算成本高昂且冗余的重新规划需求。实验结果表明，在长时程规划任务中，我们实现了平均轨迹长度13.5%的提升和平均奖励12.7%的提升，这表明碰撞率有所降低，且安全导航环境的能力得到增强。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日