利用生成基础模型扩展单次人类示范进行模仿学习 (Scaling Single Human Demonstrations for Imitation Learning using Generative Foundational Models) - 专知论文

会员服务 ·

0

演示 · Gen · 模仿学习 · 操作 · 机器人 ·

Scaling Single Human Demonstrations for Imitation Learning using Generative Foundational Models

翻译：利用生成基础模型扩展单次人类示范进行模仿学习

Nick Heppert,Minh Quang Nguyen,Abhinav Valada

from arxiv, ICRA 2026, 8 pages, 6 figures, 4 tables

Imitation learning is a popular paradigm to teach robots new tasks, but collecting robot demonstrations through teleoperation or kinesthetic teaching is tedious and time-consuming. In contrast, directly demonstrating a task using our human embodiment is much easier and data is available in abundance, yet transfer to the robot can be non-trivial. In this work, we propose Real2Gen to train a manipulation policy from a single human demonstration. Real2Gen extracts required information from the demonstration and transfers it to a simulation environment, where a programmable expert agent can demonstrate the task arbitrarily many times, generating an unlimited amount of data to train a flow matching policy. We evaluate Real2Gen on human demonstrations from three different real-world tasks and compare it to a recent baseline. Real2Gen shows an average increase in the success rate of 26.6% and better generalization of the trained policy due to the abundance and diversity of training data. We further deploy our purely simulation-trained policy zero-shot in the real world. We make the data, code, and trained models publicly available at real2gen.cs.uni-freiburg.de.

翻译：模仿学习是教授机器人新任务的常用范式，但通过遥操作或示教方式收集机器人演示数据既繁琐又耗时。相比之下，利用人类自身直接演示任务更为简便且数据丰富，然而将其迁移至机器人平台可能面临挑战。本研究提出Real2Gen方法，旨在通过单次人类演示训练机械臂操作策略。Real2Gen从人类演示中提取必要信息并迁移至仿真环境，在该环境中可编程的专家智能体能够无限次演示任务，从而生成海量数据用于训练流匹配策略。我们在三个不同现实任务的人类演示数据上评估Real2Gen，并与近期基线方法进行对比。得益于训练数据的丰富性与多样性，Real2Gen的平均成功率提升26.6%，且训练策略展现出更优的泛化能力。我们进一步将纯仿真训练的部署策略在现实世界中进行了零样本验证。相关数据、代码及训练模型已在real2gen.cs.uni-freiburg.de公开。

0

相关内容

【CMU博士论文】构建通用机器人生成范式：基础设施、扩展性与策略学习

【CMU博士论文】构建通用机器人生成范式：基础设施、扩展性与策略学习

专知会员服务

30+阅读 · 2024年12月6日

机器人中的深度生成模型：多模态演示学习的综述

机器人中的深度生成模型：多模态演示学习的综述

专知会员服务

39+阅读 · 2024年8月9日

南京大学&港中文联合总结: 29页中文详述《模仿学习》完整过程

南京大学&港中文联合总结: 29页中文详述《模仿学习》完整过程

专知会员服务

63+阅读 · 2022年2月3日

模仿学习: 进展，分类和机会

专知会员服务

48+阅读 · 2021年7月2日

机器人运动轨迹的模仿学习综述

机器人运动轨迹的模仿学习综述

专知会员服务

45+阅读 · 2021年6月8日

最新《模仿学习(Imitation Learning》进展报告, 加州理工Yisong Yue教授，附下载

最新《模仿学习(Imitation Learning》进展报告, 加州理工Yisong Yue教授，附下载

专知会员服务

41+阅读 · 2020年12月6日

【单样本(One-shot)学习】《One-shot learning》by Pragati Baheti Part 1/2: Definitions and fundamental techniques

【单样本(One-shot)学习】《One-shot learning》by Pragati Baheti Part 1/2: Definitions and fundamental techniques

专知会员服务

30+阅读 · 2020年4月22日

基于生成对抗网络的模仿学习综述, 苏州大学，计算机学报

专知会员服务

47+阅读 · 2020年2月1日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

专知会员服务

153+阅读 · 2019年10月18日

【斯坦福博士论文】将深度学习机器人学习扩展到广泛的现实世界数据，176页pdf

【斯坦福博士论文】将深度学习机器人学习扩展到广泛的现实世界数据，176页pdf

专知

12+阅读 · 2023年4月4日

【Uber AI新论文】持续元学习，Learning to Continually Learn

【Uber AI新论文】持续元学习，Learning to Continually Learn

专知

19+阅读 · 2020年2月27日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

FewRel 2.0数据集：以近知远，以一知万，少次学习新挑战

FewRel 2.0数据集：以近知远，以一知万，少次学习新挑战

PaperWeekly

24+阅读 · 2019年11月6日

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

机器之心

13+阅读 · 2019年10月17日

【CMU教程】高效大规模机器学习训练，198页PDF带你概览领域前沿进展

【CMU教程】高效大规模机器学习训练，198页PDF带你概览领域前沿进展

专知

14+阅读 · 2019年10月9日

常用的模型集成方法介绍：bagging、boosting 、stacking

常用的模型集成方法介绍：bagging、boosting 、stacking

机器之心

14+阅读 · 2019年5月15日

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

专知

363+阅读 · 2019年4月12日

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

机器之心

15+阅读 · 2019年3月18日

基于逆强化学习的示教学习方法综述

基于逆强化学习的示教学习方法综述

计算机研究与发展

16+阅读 · 2019年2月25日

基于深度学习的复杂场景下人体行为识别研究

国家自然科学基金

9+阅读 · 2015年12月31日

基于人机交互的数据驱动式人群行为建模与仿真研究

国家自然科学基金

4+阅读 · 2015年12月31日

基于高斯过程模型的多示例多标记学习算法研究

国家自然科学基金

14+阅读 · 2015年12月31日

面向聋儿言语康复的多模态人机交互模型及技术研究

国家自然科学基金

3+阅读 · 2015年12月31日

仿人轻型机械臂人机协作模式关键技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向大数据的安全迁移学习方法

国家自然科学基金

31+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

基于逆向强化学习和人工智能的移动机器人自主学习方法研究

国家自然科学基金

12+阅读 · 2013年12月31日

数据和模型混合驱动的虚拟人群行为仿真技术研究及其在军事中的应用

国家自然科学基金

10+阅读 · 2011年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation

Arxiv

0+阅读 · 2月18日

Feasibility-aware Imitation Learning from Observation with Multimodal Feedback

Arxiv

0+阅读 · 2月17日

Imitating What Works: Simulation-Filtered Modular Policy Learning from Human Videos

Arxiv

0+阅读 · 2月13日

Human Preference Modeling Using Visual Motion Prediction Improves Robot Skill Learning from Egocentric Human Video

Arxiv

0+阅读 · 2月11日

Flow-Enabled Generalization to Human Demonstrations in Few-Shot Imitation Learning

Arxiv

0+阅读 · 2月11日

Self-Augmented Robot Trajectory: Efficient Imitation Learning via Safe Self-augmentation with Demonstrator-annotated Precision

Arxiv

0+阅读 · 2月11日

DexImit: Learning Bimanual Dexterous Manipulation from Monocular Human Videos

Arxiv

0+阅读 · 2月10日

Hierarchical Proportion Models for Motion Generation via Integration of Motion Primitives

Arxiv

0+阅读 · 2月3日

ConceptACT: Episode-Level Concepts for Sample-Efficient Robotic Imitation Learning

Arxiv

0+阅读 · 1月23日

CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion

Arxiv

0+阅读 · 1月14日

VIP会员

文章信息

相关主题

相关VIP内容

【CMU博士论文】构建通用机器人生成范式：基础设施、扩展性与策略学习

【CMU博士论文】构建通用机器人生成范式：基础设施、扩展性与策略学习

专知会员服务

30+阅读 · 2024年12月6日

机器人中的深度生成模型：多模态演示学习的综述

机器人中的深度生成模型：多模态演示学习的综述

专知会员服务

39+阅读 · 2024年8月9日

南京大学&港中文联合总结: 29页中文详述《模仿学习》完整过程

南京大学&港中文联合总结: 29页中文详述《模仿学习》完整过程

专知会员服务

63+阅读 · 2022年2月3日

模仿学习: 进展，分类和机会

专知会员服务

48+阅读 · 2021年7月2日

机器人运动轨迹的模仿学习综述

机器人运动轨迹的模仿学习综述

专知会员服务

45+阅读 · 2021年6月8日

最新《模仿学习(Imitation Learning》进展报告, 加州理工Yisong Yue教授，附下载

最新《模仿学习(Imitation Learning》进展报告, 加州理工Yisong Yue教授，附下载

专知会员服务

41+阅读 · 2020年12月6日

【单样本(One-shot)学习】《One-shot learning》by Pragati Baheti Part 1/2: Definitions and fundamental techniques

【单样本(One-shot)学习】《One-shot learning》by Pragati Baheti Part 1/2: Definitions and fundamental techniques

专知会员服务

30+阅读 · 2020年4月22日

基于生成对抗网络的模仿学习综述, 苏州大学，计算机学报

专知会员服务

47+阅读 · 2020年2月1日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

专知会员服务

153+阅读 · 2019年10月18日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体记忆深度剖析：评价指标与系统局限性的分类体系及实证分析

《可信人工智能赋能系统的支柱》

【CMU博士论文】可靠轨迹预测的分层基石：数据、评估与方法

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

相关资讯

【斯坦福博士论文】将深度学习机器人学习扩展到广泛的现实世界数据，176页pdf

【斯坦福博士论文】将深度学习机器人学习扩展到广泛的现实世界数据，176页pdf

专知

12+阅读 · 2023年4月4日

【Uber AI新论文】持续元学习，Learning to Continually Learn

【Uber AI新论文】持续元学习，Learning to Continually Learn

专知

19+阅读 · 2020年2月27日

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

【加州理工】什么是模仿学习(Imitation Learning（模仿学习), 这62页ppt带你了解进展，附下载

专知

21+阅读 · 2019年11月14日

FewRel 2.0数据集：以近知远，以一知万，少次学习新挑战

FewRel 2.0数据集：以近知远，以一知万，少次学习新挑战

PaperWeekly

24+阅读 · 2019年11月6日

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

机器之心

13+阅读 · 2019年10月17日

【CMU教程】高效大规模机器学习训练，198页PDF带你概览领域前沿进展

【CMU教程】高效大规模机器学习训练，198页PDF带你概览领域前沿进展

专知

14+阅读 · 2019年10月9日

常用的模型集成方法介绍：bagging、boosting 、stacking

常用的模型集成方法介绍：bagging、boosting 、stacking

机器之心

14+阅读 · 2019年5月15日

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

专知

363+阅读 · 2019年4月12日

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

机器之心

15+阅读 · 2019年3月18日

基于逆强化学习的示教学习方法综述

基于逆强化学习的示教学习方法综述

计算机研究与发展

16+阅读 · 2019年2月25日

相关论文

Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation

Arxiv

0+阅读 · 2月18日

Feasibility-aware Imitation Learning from Observation with Multimodal Feedback

Arxiv

0+阅读 · 2月17日

Imitating What Works: Simulation-Filtered Modular Policy Learning from Human Videos

Arxiv

0+阅读 · 2月13日

Human Preference Modeling Using Visual Motion Prediction Improves Robot Skill Learning from Egocentric Human Video

Arxiv

0+阅读 · 2月11日

Flow-Enabled Generalization to Human Demonstrations in Few-Shot Imitation Learning

Arxiv

0+阅读 · 2月11日

Self-Augmented Robot Trajectory: Efficient Imitation Learning via Safe Self-augmentation with Demonstrator-annotated Precision

Arxiv

0+阅读 · 2月11日

DexImit: Learning Bimanual Dexterous Manipulation from Monocular Human Videos

Arxiv

0+阅读 · 2月10日

Hierarchical Proportion Models for Motion Generation via Integration of Motion Primitives

Arxiv

0+阅读 · 2月3日

ConceptACT: Episode-Level Concepts for Sample-Efficient Robotic Imitation Learning

Arxiv

0+阅读 · 1月23日

CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion

Arxiv

0+阅读 · 1月14日

相关基金

基于深度学习的复杂场景下人体行为识别研究

国家自然科学基金

9+阅读 · 2015年12月31日

基于人机交互的数据驱动式人群行为建模与仿真研究

国家自然科学基金

4+阅读 · 2015年12月31日

基于高斯过程模型的多示例多标记学习算法研究

国家自然科学基金

14+阅读 · 2015年12月31日

面向聋儿言语康复的多模态人机交互模型及技术研究

国家自然科学基金

3+阅读 · 2015年12月31日

仿人轻型机械臂人机协作模式关键技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向大数据的安全迁移学习方法

国家自然科学基金

31+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

基于逆向强化学习和人工智能的移动机器人自主学习方法研究

国家自然科学基金

12+阅读 · 2013年12月31日

数据和模型混合驱动的虚拟人群行为仿真技术研究及其在军事中的应用

国家自然科学基金

10+阅读 · 2011年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员