基于自适应经验选择与动态想象的泛化移动操作 (Spatially Generalizable Mobile Manipulation via Adaptive Experience Selection and Dynamic Imagination) - 专知论文

会员服务 ·

0

操作 · 自适应 · 泛化 · 状态空间 · 前向 ·

Spatially Generalizable Mobile Manipulation via Adaptive Experience Selection and Dynamic Imagination

翻译：基于自适应经验选择与动态想象的泛化移动操作

Ping Zhong,Liangbai Liu,Bolei Chen,Tao Wu,Jiazhi Xia,Chaoxu Mu,Jianxin Wang

Mobile Manipulation (MM) involves long-horizon decision-making over multi-stage compositions of heterogeneous skills, such as navigation and picking up objects. Despite recent progress, existing MM methods still face two key limitations: (i) low sample efficiency, due to ineffective use of redundant data generated during long-term MM interactions; and (ii) poor spatial generalization, as policies trained on specific tasks struggle to transfer to new spatial layouts without additional training. In this paper, we address these challenges through Adaptive Experience Selection (AES) and model-based dynamic imagination. In particular, AES makes MM agents pay more attention to critical experience fragments in long trajectories that affect task success, improving skill chain learning and mitigating skill forgetting. Based on AES, a Recurrent State-Space Model (RSSM) is introduced for Model-Predictive Forward Planning (MPFP) by capturing the coupled dynamics between the mobile base and the manipulator and imagining the dynamics of future manipulations. RSSM-based MPFP can reinforce MM skill learning on the current task while enabling effective generalization to new spatial layouts. Comparative studies across different experimental configurations demonstrate that our method significantly outperforms existing MM policies. Real-world experiments further validate the feasibility and practicality of our method.

翻译：移动操作涉及对异构技能（如导航与拾取物体）多阶段组合的长时程决策。尽管近期取得进展，现有移动操作方法仍面临两个关键局限：（i）样本效率低下，源于对长期移动操作交互中产生的冗余数据利用不足；（ii）空间泛化能力差，在特定任务上训练的策略难以迁移到新空间布局而无需重新训练。本文通过自适应经验选择与基于模型的动态想象应对这些挑战。具体而言，自适应经验选择使移动操作智能体更关注长轨迹中影响任务成败的关键经验片段，从而提升技能链学习效果并缓解技能遗忘。基于该方法，我们引入循环状态空间模型，通过捕捉移动底盘与机械臂的耦合动力学特性并推演未来操作的动态过程，实现基于模型预测的前向规划。基于循环状态空间模型的前向规划既能强化当前任务的技能学习，又能实现对新空间布局的有效泛化。不同实验配置的对比研究表明，本方法显著优于现有移动操作策略。实物实验进一步验证了本方法的可行性与实用性。

0

相关内容

面向机器人操作的基于大型视觉‑语言模型（VLM）的视觉‑语言‑动作（VLA）模型综述

面向机器人操作的基于大型视觉‑语言模型（VLM）的视觉‑语言‑动作（VLA）模型综述

专知会员服务

34+阅读 · 2025年8月19日

【阿姆斯特丹博士论文】在测试时学习泛化

【阿姆斯特丹博士论文】在测试时学习泛化

专知会员服务

12+阅读 · 2025年6月3日

《泛域指挥控制决策中情境感知与情境识别综述》最新55页综述报告

《泛域指挥控制决策中情境感知与情境识别综述》最新55页综述报告

专知会员服务

37+阅读 · 2025年5月19日

《多模态适应与泛化》进展综述：从传统方法到基础模型

《多模态适应与泛化》进展综述：从传统方法到基础模型

专知会员服务

30+阅读 · 2025年1月31日

多模态移动智能体的基础与最新趋势：综述

多模态移动智能体的基础与最新趋势：综述

专知会员服务

37+阅读 · 2024年11月6日

【CMU博士论文】强化学习的泛化灵巧性，182页pdf

【CMU博士论文】强化学习的泛化灵巧性，182页pdf

专知会员服务

41+阅读 · 2023年11月4日

【伯克利博士论文】学习在动态环境中泛化，103页pdf

【伯克利博士论文】学习在动态环境中泛化，103页pdf

专知会员服务

72+阅读 · 2022年10月12日

【爱丁堡博士论文】深度神经移动网络，Deep Neural Mobile Networking

【爱丁堡博士论文】深度神经移动网络，Deep Neural Mobile Networking

专知会员服务

20+阅读 · 2020年11月12日

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

专知会员服务

78+阅读 · 2020年2月25日

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

专知会员服务

99+阅读 · 2019年11月11日

【254页博士论文】《动态多目标环境中基于深度强化学习的智能决策方案》

【254页博士论文】《动态多目标环境中基于深度强化学习的智能决策方案》

专知

32+阅读 · 2022年10月17日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

专知

48+阅读 · 2019年11月12日

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

机器之心

13+阅读 · 2019年10月17日

【资源】领域自适应相关论文、代码分享

【资源】领域自适应相关论文、代码分享

专知

32+阅读 · 2019年10月12日

迁移自适应学习最新综述，附21页论文下载

迁移自适应学习最新综述，附21页论文下载

专知

34+阅读 · 2019年3月13日

领域自适应学习论文大列表

领域自适应学习论文大列表

专知

71+阅读 · 2019年3月2日

迁移学习在深度学习中的应用

迁移学习在深度学习中的应用

专知

24+阅读 · 2017年12月24日

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

AI100

16+阅读 · 2017年12月23日

深度 | 迁移学习全面概述：从基本概念到相关研究

深度 | 迁移学习全面概述：从基本概念到相关研究

七月在线实验室

15+阅读 · 2017年8月15日

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

43+阅读 · 2015年12月31日

面向多云块并行移动计算迁移的环境自适应程序分割技术研究

国家自然科学基金

1+阅读 · 2015年12月31日

移动云计算复杂网络环境下任务粒度的应用划分和调度方法

国家自然科学基金

0+阅读 · 2015年12月31日

广域动态的野外环境中移动机器人六维全局定位方法的研究

国家自然科学基金

1+阅读 · 2015年12月31日

机器灵巧手基于触滑觉信息协同的自适应力控制方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

移动云计算中数据流应用的动态计算切分技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

移动互联网服务及隐私保护的理论与关键技术研究

国家自然科学基金

1+阅读 · 2014年12月31日

自动化集装箱码头装卸作业的时空同步策略与优化方法

国家自然科学基金

1+阅读 · 2014年12月31日

基于深度学习的特征融合在移动机器人视觉中的场景理解及研究

国家自然科学基金

12+阅读 · 2014年12月31日

基于逆向强化学习和人工智能的移动机器人自主学习方法研究

国家自然科学基金

12+阅读 · 2013年12月31日

Align and Adapt: Multimodal Multiview Human Activity Recognition under Arbitrary View Combinations

Align and Adapt: Multimodal Multiview Human Activity Recognition under Arbitrary View Combinations

Arxiv

0+阅读 · 2月18日

Seeing the Bigger Picture: 3D Latent Mapping for Mobile Manipulation Policy Learning

Arxiv

0+阅读 · 2月16日

Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows

Arxiv

0+阅读 · 2月10日

Understanding and Improving Length Generalization in Hierarchical Sparse Attention Models

Arxiv

0+阅读 · 2月5日

MobileManiBench: Simplifying Model Verification for Mobile Manipulation

Arxiv

0+阅读 · 2月5日

Scene-Adaptive Motion Planning with Explicit Mixture of Experts and Interaction-Oriented Optimization

Arxiv

0+阅读 · 2月3日

3D Dynamics-Aware Manipulation: Endowing Manipulation Policies with 3D Foresight

Arxiv

0+阅读 · 2月2日

Learning Adaptive Cross-Embodiment Visuomotor Policy with Contrastive Prompt Orchestration

Arxiv

0+阅读 · 2月1日

Generalizable Domain Adaptation for Sim-and-Real Policy Co-Training

Arxiv

0+阅读 · 1月16日

Skill-Aware Diffusion for Generalizable Robotic Manipulation

Arxiv

0+阅读 · 1月16日

VIP会员

文章信息

相关主题

相关VIP内容

面向机器人操作的基于大型视觉‑语言模型（VLM）的视觉‑语言‑动作（VLA）模型综述

面向机器人操作的基于大型视觉‑语言模型（VLM）的视觉‑语言‑动作（VLA）模型综述

专知会员服务

34+阅读 · 2025年8月19日

【阿姆斯特丹博士论文】在测试时学习泛化

【阿姆斯特丹博士论文】在测试时学习泛化

专知会员服务

12+阅读 · 2025年6月3日

《泛域指挥控制决策中情境感知与情境识别综述》最新55页综述报告

《泛域指挥控制决策中情境感知与情境识别综述》最新55页综述报告

专知会员服务

37+阅读 · 2025年5月19日

《多模态适应与泛化》进展综述：从传统方法到基础模型

《多模态适应与泛化》进展综述：从传统方法到基础模型

专知会员服务

30+阅读 · 2025年1月31日

多模态移动智能体的基础与最新趋势：综述

多模态移动智能体的基础与最新趋势：综述

专知会员服务

37+阅读 · 2024年11月6日

【CMU博士论文】强化学习的泛化灵巧性，182页pdf

【CMU博士论文】强化学习的泛化灵巧性，182页pdf

专知会员服务

41+阅读 · 2023年11月4日

【伯克利博士论文】学习在动态环境中泛化，103页pdf

【伯克利博士论文】学习在动态环境中泛化，103页pdf

专知会员服务

72+阅读 · 2022年10月12日

【爱丁堡博士论文】深度神经移动网络，Deep Neural Mobile Networking

【爱丁堡博士论文】深度神经移动网络，Deep Neural Mobile Networking

专知会员服务

20+阅读 · 2020年11月12日

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

【CVPR2020】用于细粒度动作识别的多模式域自适应，Multi-Modal Domain Adaptation for Fine-Grained Action Recognition

专知会员服务

78+阅读 · 2020年2月25日

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

专知会员服务

99+阅读 · 2019年11月11日

热门VIP内容

开通专知VIP会员享更多权益服务

论学习、公平性与复杂度

《整合杀伤链：一个用于边缘目标验证与战术推理的零样本框架》最新资料

2025中国人工智能学会系列白皮书⸺棋盘上的人工智能|附下载

通用智能体评估的逻辑架构

相关资讯

【254页博士论文】《动态多目标环境中基于深度强化学习的智能决策方案》

【254页博士论文】《动态多目标环境中基于深度强化学习的智能决策方案》

专知

32+阅读 · 2022年10月17日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

专知

48+阅读 · 2019年11月12日

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

机器之心

13+阅读 · 2019年10月17日

【资源】领域自适应相关论文、代码分享

【资源】领域自适应相关论文、代码分享

专知

32+阅读 · 2019年10月12日

迁移自适应学习最新综述，附21页论文下载

迁移自适应学习最新综述，附21页论文下载

专知

34+阅读 · 2019年3月13日

领域自适应学习论文大列表

领域自适应学习论文大列表

专知

71+阅读 · 2019年3月2日

迁移学习在深度学习中的应用

迁移学习在深度学习中的应用

专知

24+阅读 · 2017年12月24日

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

AI100

16+阅读 · 2017年12月23日

深度 | 迁移学习全面概述：从基本概念到相关研究

深度 | 迁移学习全面概述：从基本概念到相关研究

七月在线实验室

15+阅读 · 2017年8月15日

相关论文

Align and Adapt: Multimodal Multiview Human Activity Recognition under Arbitrary View Combinations

Align and Adapt: Multimodal Multiview Human Activity Recognition under Arbitrary View Combinations

Arxiv

0+阅读 · 2月18日

Seeing the Bigger Picture: 3D Latent Mapping for Mobile Manipulation Policy Learning

Arxiv

0+阅读 · 2月16日

Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows

Arxiv

0+阅读 · 2月10日

Understanding and Improving Length Generalization in Hierarchical Sparse Attention Models

Arxiv

0+阅读 · 2月5日

MobileManiBench: Simplifying Model Verification for Mobile Manipulation

Arxiv

0+阅读 · 2月5日

Scene-Adaptive Motion Planning with Explicit Mixture of Experts and Interaction-Oriented Optimization

Arxiv

0+阅读 · 2月3日

3D Dynamics-Aware Manipulation: Endowing Manipulation Policies with 3D Foresight

Arxiv

0+阅读 · 2月2日

Learning Adaptive Cross-Embodiment Visuomotor Policy with Contrastive Prompt Orchestration

Arxiv

0+阅读 · 2月1日

Generalizable Domain Adaptation for Sim-and-Real Policy Co-Training

Arxiv

0+阅读 · 1月16日

Skill-Aware Diffusion for Generalizable Robotic Manipulation

Arxiv

0+阅读 · 1月16日

相关基金

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

43+阅读 · 2015年12月31日

面向多云块并行移动计算迁移的环境自适应程序分割技术研究

国家自然科学基金

1+阅读 · 2015年12月31日

移动云计算复杂网络环境下任务粒度的应用划分和调度方法

国家自然科学基金

0+阅读 · 2015年12月31日

广域动态的野外环境中移动机器人六维全局定位方法的研究

国家自然科学基金

1+阅读 · 2015年12月31日

机器灵巧手基于触滑觉信息协同的自适应力控制方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

移动云计算中数据流应用的动态计算切分技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

移动互联网服务及隐私保护的理论与关键技术研究

国家自然科学基金

1+阅读 · 2014年12月31日

自动化集装箱码头装卸作业的时空同步策略与优化方法

国家自然科学基金

1+阅读 · 2014年12月31日

基于深度学习的特征融合在移动机器人视觉中的场景理解及研究

国家自然科学基金

12+阅读 · 2014年12月31日

基于逆向强化学习和人工智能的移动机器人自主学习方法研究

国家自然科学基金

12+阅读 · 2013年12月31日

微信扫码咨询专知VIP会员