Knowledge Reutilization in Meta-Reinforcement Learning - 专知论文

会员服务 ·

0

知识 · 重用 · 元强化学习 · 耦合 · 非参数 ·

Knowledge Reutilization in Meta-Reinforcement Learning

翻译：元强化学习中的知识重用

Yuan Meng,Bo Wang,Juan de los Rios Ruiz,Xiangtong Yao,Zhenshan Bing,Fuchun Sun,Alois Knoll

from arxiv, 18 pages initial submission

Meta-reinforcement learning enables fast adaptation by extracting shared structure from related tasks, but existing end-to-end methods often couple task inference with embodiment-specific control. This coupling can obscure non-parametric task semantics, reduce sample efficiency, and limit cross-agent reuse. We propose a meta-knowledge reutilization framework that learns task-level knowledge on a dynamics-simplified agent and transfers it to heterogeneous agents. The framework uses a Bayesian non-parametric prior to organize latent task modes and a high-level policy to generate task-level magnitude guidance. To bridge reusable task knowledge with different embodiments, we introduce a semantic-magnitude interface and a lightweight temporal adaptor, which convert frozen meta-knowledge into temporally aligned subgoals for embodiment-specific low-level controllers. Experiments on multiple locomotion agents show that our framework reduces final-step tracking error by 94.75% -- 99.79% compared with recent state-of-the-art baselines and achieves comparable deployment performance with about 23.8% of their interaction data.

翻译：元强化学习通过从相关任务中提取共享结构实现快速适应，但现有端到端方法常将任务推理与具身特定控制耦合。这种耦合可能掩盖非参数化任务语义、降低样本效率并限制跨智能体重用。我们提出一种元知识重用框架，该框架在动力学简化智能体上学习任务级知识，并将其迁移至异构智能体。该框架采用贝叶斯非参数先验组织潜在任务模式，并通过高层策略生成任务级幅度引导。为桥接可重用任务知识与不同具身形态，我们引入语义-幅度接口与轻量级时序适配器，将冻结的元知识转换为具身特定低级控制器所需的时序对齐子目标。在多运动智能体上的实验表明，相较于近期最先进基线方法，本框架将最终步长跟踪误差降低94.75%-99.79%，且仅需约23.8%的交互数据即可达到相当的部署性能。

0

相关内容

【HKUST博士论文】复杂任务下的元学习

【HKUST博士论文】复杂任务下的元学习

专知会员服务

24+阅读 · 2025年1月14日

《元强化学习》最新，70页ppt

《元强化学习》最新，70页ppt

专知会员服务

83+阅读 · 2022年9月16日

【ICML2022】Transformer是元强化学习器

【ICML2022】Transformer是元强化学习器

专知会员服务

56+阅读 · 2022年6月15日

【2022新书】元学习(Meta Learning ): 自动机器学习与数据挖掘，Metalearning: Applications to Automated Machine Learning and Data Mining

【2022新书】元学习(Meta Learning ): 自动机器学习与数据挖掘，Metalearning: Applications to Automated Machine Learning and Data Mining

专知会员服务

159+阅读 · 2022年3月7日

【ICML2021】授权驱动探索的元强化学习

专知会员服务

28+阅读 · 2021年5月24日

「元强化学习」报告，斯坦福Chelsea Finn讲解，52页ppt，Meta Reinforcement Learning

「元强化学习」报告，斯坦福Chelsea Finn讲解，52页ppt，Meta Reinforcement Learning

专知会员服务

43+阅读 · 2021年1月11日

元学习(meta learning) 最新进展综述论文

元学习(meta learning) 最新进展综述论文

专知会员服务

281+阅读 · 2020年5月8日

【普林斯顿大学-微软】加权元学习，Weighted Meta-Learning

【普林斯顿大学-微软】加权元学习，Weighted Meta-Learning

专知会员服务

40+阅读 · 2020年3月25日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【得克萨斯大学奥斯汀分校】无记忆元学习（Meta-Learning without Memorization），Mingzhang Yin，Mingyuan Zhou

【得克萨斯大学奥斯汀分校】无记忆元学习（Meta-Learning without Memorization），Mingzhang Yin，Mingyuan Zhou

专知会员服务

10+阅读 · 2019年12月12日

【2022新书】元学习(Meta Learning ): 自动机器学习与数据挖掘

【2022新书】元学习(Meta Learning ): 自动机器学习与数据挖掘

专知

21+阅读 · 2022年3月7日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

专知

72+阅读 · 2020年2月29日

最近必读的六篇【Meta-Learning（元学习）】相关论文和代码

最近必读的六篇【Meta-Learning（元学习）】相关论文和代码

专知

61+阅读 · 2019年11月3日

元学习—Meta Learning的兴起

元学习—Meta Learning的兴起

专知

44+阅读 · 2019年10月19日

近期必读的八篇【Meta-Learning（元学习）】相关论文和代码

近期必读的八篇【Meta-Learning（元学习）】相关论文和代码

专知

134+阅读 · 2019年9月15日

元学习（Meta Learning）最全论文、视频、书籍资源整理

元学习（Meta Learning）最全论文、视频、书籍资源整理

深度学习与NLP

22+阅读 · 2019年6月20日

元学习(Meta-Learning) 综述及五篇顶会论文推荐

元学习(Meta-Learning) 综述及五篇顶会论文推荐

专知

194+阅读 · 2019年4月14日

【资源推荐】元学习（meta-learning）相关文献资源大列表

【资源推荐】元学习（meta-learning）相关文献资源大列表

专知

25+阅读 · 2019年3月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Meta-Learning 元学习：学会快速学习

Meta-Learning 元学习：学会快速学习

GAN生成式对抗网络

20+阅读 · 2018年12月8日

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

43+阅读 · 2015年12月31日

基于复杂图知识表示的终身强化学习研究

国家自然科学基金

41+阅读 · 2015年12月31日

基于重要性采样的并行离策略强化学习方法研究

国家自然科学基金

24+阅读 · 2015年12月31日

公共组织跨部门知识共享机理、绩效激励与实现机制重塑研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于极限学习单元的多生物特征图像深度学习建模与识别研究

国家自然科学基金

2+阅读 · 2015年12月31日

面向大规模多步学习问题的学习分类元系统技术研究

国家自然科学基金

5+阅读 · 2015年12月31日

面向大数据的知识表示、推理、在线学习理论及应用研究

国家自然科学基金

12+阅读 · 2014年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

基于多智能体强化学习的多机器人系统研究

国家自然科学基金

50+阅读 · 2009年12月31日

基于支持向量机的复杂连续系统强化学习控制研究

国家自然科学基金

12+阅读 · 2008年12月31日

Reference Architecture for Metadata-driven Services to Promote Reusability in Software Systems

Arxiv

0+阅读 · 6月15日

Meta-Learning Transformers to Improve In-Context Generalization

Arxiv

0+阅读 · 6月11日

Learning to Adapt: Representation-Based Reinforcement Learning for Multi-Task Skill Transfer

Arxiv

0+阅读 · 6月11日

ReSkill: Reconciling Skill Creation with Policy Optimization in Agentic RL

Arxiv

0+阅读 · 6月8日

Exact Unlearning in Reinforcement Learning

Arxiv

0+阅读 · 6月2日

Answer-Set-Programming-based Abstractions for Reinforcement Learning

Arxiv

0+阅读 · 5月29日

A Taxonomy of Metacognitive Learning Scenarios in Professional Contexts: Integrating Systems Theory with Empirical Constraints

Arxiv

0+阅读 · 5月22日

R$^3$L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification

Arxiv

0+阅读 · 5月22日

General Preference Reinforcement Learning

Arxiv

0+阅读 · 5月18日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

VIP会员

文章信息

相关主题

元强化学习

最新内容

从采集到决策：美军视角下的战术情报范式重构

从采集到决策：美军视角下的战术情报范式重构

专知会员服务

1+阅读 · 今天2:42

乌克兰“德尔塔”系统揭示无人机、数据与领导力如何重塑现代安全格局

乌克兰“德尔塔”系统揭示无人机、数据与领导力如何重塑现代安全格局

专知会员服务

1+阅读 · 今天2:37

大规模作战中的参谋流程：作为联合兵种作战组成部分的目标锁定

大规模作战中的参谋流程：作为联合兵种作战组成部分的目标锁定

专知会员服务

2+阅读 · 今天2:23

《北约概念开发与实验（CD&E）手册：概念开发者工具箱》100页手册

《北约概念开发与实验（CD&E）手册：概念开发者工具箱》100页手册

专知会员服务

5+阅读 · 今天2:21

《履带式无人地面战车技术发展现状》

《履带式无人地面战车技术发展现状》

专知会员服务

2+阅读 · 今天1:46

《美国空军B-2“幽灵”隐身轰炸机系统工程案例研究》117页

《美国空军B-2“幽灵”隐身轰炸机系统工程案例研究》117页

专知会员服务

5+阅读 · 8月1日

隐身技术前沿综述：物理机理、工程实践与战略展望

隐身技术前沿综述：物理机理、工程实践与战略展望

专知会员服务

4+阅读 · 8月1日

《多变海洋环境下无人水面艇与自主水下机器人对接的最优路径规划》

《多变海洋环境下无人水面艇与自主水下机器人对接的最优路径规划》

专知会员服务

3+阅读 · 8月1日

《以机反机：基于无人机载麦克风的空中周界入侵检测》

《以机反机：基于无人机载麦克风的空中周界入侵检测》

专知会员服务

4+阅读 · 8月1日

《无人机脆弱性利用：网络空间力量的新域》

《无人机脆弱性利用：网络空间力量的新域》

专知会员服务

2+阅读 · 8月1日

美空军如何将人工智能从战场部署至后方机关

美空军如何将人工智能从战场部署至后方机关

专知会员服务

11+阅读 · 7月31日

《美战争部指令文件：网络空间效应与使能能力测试评估》

《美战争部指令文件：网络空间效应与使能能力测试评估》

专知会员服务

8+阅读 · 7月31日

《史诗怒火行动：多域前瞻评估》49页报告

《史诗怒火行动：多域前瞻评估》49页报告

专知会员服务

7+阅读 · 7月31日

《英国防部：未来空战系统数字化战略》33页

《英国防部：未来空战系统数字化战略》33页

专知会员服务

5+阅读 · 7月31日

《面向自主飞行网络的智能体人工智能架构》

《面向自主飞行网络的智能体人工智能架构》

专知会员服务

7+阅读 · 7月31日

相关VIP内容

【HKUST博士论文】复杂任务下的元学习

【HKUST博士论文】复杂任务下的元学习

专知会员服务

24+阅读 · 2025年1月14日

《元强化学习》最新，70页ppt

《元强化学习》最新，70页ppt

专知会员服务

83+阅读 · 2022年9月16日

【ICML2022】Transformer是元强化学习器

【ICML2022】Transformer是元强化学习器

专知会员服务

56+阅读 · 2022年6月15日

【2022新书】元学习(Meta Learning ): 自动机器学习与数据挖掘，Metalearning: Applications to Automated Machine Learning and Data Mining

【2022新书】元学习(Meta Learning ): 自动机器学习与数据挖掘，Metalearning: Applications to Automated Machine Learning and Data Mining

专知会员服务

159+阅读 · 2022年3月7日

【ICML2021】授权驱动探索的元强化学习

专知会员服务

28+阅读 · 2021年5月24日

「元强化学习」报告，斯坦福Chelsea Finn讲解，52页ppt，Meta Reinforcement Learning

「元强化学习」报告，斯坦福Chelsea Finn讲解，52页ppt，Meta Reinforcement Learning

专知会员服务

43+阅读 · 2021年1月11日

元学习(meta learning) 最新进展综述论文

元学习(meta learning) 最新进展综述论文

专知会员服务

281+阅读 · 2020年5月8日

【普林斯顿大学-微软】加权元学习，Weighted Meta-Learning

【普林斯顿大学-微软】加权元学习，Weighted Meta-Learning

专知会员服务

40+阅读 · 2020年3月25日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【得克萨斯大学奥斯汀分校】无记忆元学习（Meta-Learning without Memorization），Mingzhang Yin，Mingyuan Zhou

【得克萨斯大学奥斯汀分校】无记忆元学习（Meta-Learning without Memorization），Mingzhang Yin，Mingyuan Zhou

专知会员服务

10+阅读 · 2019年12月12日

热门VIP内容

开通专知VIP会员享更多权益服务

乌克兰“德尔塔”系统揭示无人机、数据与领导力如何重塑现代安全格局

《北约概念开发与实验（CD&E）手册：概念开发者工具箱》100页手册

从采集到决策：美军视角下的战术情报范式重构

大规模作战中的参谋流程：作为联合兵种作战组成部分的目标锁定

相关资讯

【2022新书】元学习(Meta Learning ): 自动机器学习与数据挖掘

【2022新书】元学习(Meta Learning ): 自动机器学习与数据挖掘

专知

21+阅读 · 2022年3月7日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning，33页ppt

专知

72+阅读 · 2020年2月29日

最近必读的六篇【Meta-Learning（元学习）】相关论文和代码

最近必读的六篇【Meta-Learning（元学习）】相关论文和代码

专知

61+阅读 · 2019年11月3日

元学习—Meta Learning的兴起

元学习—Meta Learning的兴起

专知

44+阅读 · 2019年10月19日

近期必读的八篇【Meta-Learning（元学习）】相关论文和代码

近期必读的八篇【Meta-Learning（元学习）】相关论文和代码

专知

134+阅读 · 2019年9月15日

元学习（Meta Learning）最全论文、视频、书籍资源整理

元学习（Meta Learning）最全论文、视频、书籍资源整理

深度学习与NLP

22+阅读 · 2019年6月20日

元学习(Meta-Learning) 综述及五篇顶会论文推荐

元学习(Meta-Learning) 综述及五篇顶会论文推荐

专知

194+阅读 · 2019年4月14日

【资源推荐】元学习（meta-learning）相关文献资源大列表

【资源推荐】元学习（meta-learning）相关文献资源大列表

专知

25+阅读 · 2019年3月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Meta-Learning 元学习：学会快速学习

Meta-Learning 元学习：学会快速学习

GAN生成式对抗网络

20+阅读 · 2018年12月8日

相关论文

Reference Architecture for Metadata-driven Services to Promote Reusability in Software Systems

Arxiv

0+阅读 · 6月15日

Meta-Learning Transformers to Improve In-Context Generalization

Arxiv

0+阅读 · 6月11日

Learning to Adapt: Representation-Based Reinforcement Learning for Multi-Task Skill Transfer

Arxiv

0+阅读 · 6月11日

ReSkill: Reconciling Skill Creation with Policy Optimization in Agentic RL

Arxiv

0+阅读 · 6月8日

Exact Unlearning in Reinforcement Learning

Arxiv

0+阅读 · 6月2日

Answer-Set-Programming-based Abstractions for Reinforcement Learning

Arxiv

0+阅读 · 5月29日

A Taxonomy of Metacognitive Learning Scenarios in Professional Contexts: Integrating Systems Theory with Empirical Constraints

Arxiv

0+阅读 · 5月22日

R$^3$L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification

Arxiv

0+阅读 · 5月22日

General Preference Reinforcement Learning

Arxiv

0+阅读 · 5月18日

MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration

Arxiv

12+阅读 · 2021年2月7日

相关基金

针对大规模环境下复杂任务的策略搜索强化学习方法研究

国家自然科学基金

43+阅读 · 2015年12月31日

基于复杂图知识表示的终身强化学习研究

国家自然科学基金

41+阅读 · 2015年12月31日

基于重要性采样的并行离策略强化学习方法研究

国家自然科学基金

24+阅读 · 2015年12月31日

公共组织跨部门知识共享机理、绩效激励与实现机制重塑研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于极限学习单元的多生物特征图像深度学习建模与识别研究

国家自然科学基金

2+阅读 · 2015年12月31日

面向大规模多步学习问题的学习分类元系统技术研究

国家自然科学基金

5+阅读 · 2015年12月31日

面向大数据的知识表示、推理、在线学习理论及应用研究

国家自然科学基金

12+阅读 · 2014年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

基于多智能体强化学习的多机器人系统研究

国家自然科学基金

50+阅读 · 2009年12月31日

基于支持向量机的复杂连续系统强化学习控制研究

国家自然科学基金

12+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员