行为克隆中的问题空间变换用于分布外泛化 (Problem Space Transformations for Out-of-Distribution Generalisation in Behavioural Cloning) - 专知论文

会员服务 ·

0

变换 · 泛化 · 操作 · 分布外泛化 · 机器人操作 ·

Problem Space Transformations for Out-of-Distribution Generalisation in Behavioural Cloning

翻译：行为克隆中的问题空间变换用于分布外泛化

Kiran Doshi,Marco Bagatella,Stelian Coros

The combination of behavioural cloning and neural networks has driven significant progress in robotic manipulation. As these algorithms may require a large number of demonstrations for each task of interest, they remain fundamentally inefficient in complex scenarios, in which finite datasets can hardly cover the state space. One of the remaining challenges is thus out-of-distribution (OOD) generalisation, i.e. the ability to predict correct actions for states with a low likelihood with respect to the state occupancy induced by the dataset. This issue is aggravated when the system to control is treated as a black-box, ignoring its physical properties. This work highlights widespread properties of robotic manipulation, specifically pose equivariance and locality. We investigate the effect of the choice of problem space on OOD performance of BC policies and how transformations arising from characteristic properties of manipulation can be employed for its improvement. Through controlled, simulated and real-world experiments, we empirically demonstrate that these transformations allow behaviour cloning policies, using either standard MLP-based one-step action prediction or diffusion-based action-sequence prediction, to generalise better to certain OOD problem instances. Code is available at https://github.com/kirandoshi/pst_ood_gen.

翻译：行为克隆与神经网络的结合推动了机器人操作领域的显著进展。由于这些算法可能需要对每个感兴趣的任务进行大量演示，因此在复杂场景中它们本质上仍然效率低下，因为有限的数据集难以覆盖整个状态空间。因此，剩余的挑战之一是分布外泛化，即预测在数据集诱导的状态占用分布中似然较低的状态下正确动作的能力。当被控系统被视为黑箱而忽略其物理特性时，这一问题会加剧。本研究重点探讨了机器人操作中普遍存在的特性，特别是姿态等变性和局部性。我们研究了问题空间的选择对行为克隆策略的OOD性能的影响，以及如何利用操作任务的特征属性所产生的变换来改进性能。通过受控的仿真和真实世界实验，我们经验性地证明，这些变换能使行为克隆策略——无论是使用标准的基于MLP的单步动作预测还是基于扩散的动作序列预测——在特定的OOD问题实例上实现更好的泛化。代码可在 https://github.com/kirandoshi/pst_ood_gen 获取。

0

相关内容

【博士论文】在低维和高维空间中分析、建模和转换潜在表征

【博士论文】在低维和高维空间中分析、建模和转换潜在表征

专知会员服务

18+阅读 · 2025年10月26日

深度学习中泛化的量化、理解与改进

深度学习中泛化的量化、理解与改进

专知会员服务

17+阅读 · 2025年9月13日

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

专知会员服务

22+阅读 · 2025年8月23日

【ETZH博士论文】低维与高维空间中潜在表示的分析、建模与变换，169页pdf

【ETZH博士论文】低维与高维空间中潜在表示的分析、建模与变换，169页pdf

专知会员服务

19+阅读 · 2025年7月30日

《时空变化领域中的学习与决策》134页

《时空变化领域中的学习与决策》134页

专知会员服务

16+阅读 · 2025年5月10日

【剑桥大学博士论文】机器学习中的分布外泛化，214页pdf

【剑桥大学博士论文】机器学习中的分布外泛化，214页pdf

专知会员服务

87+阅读 · 2023年9月13日

Transformer如何用到强化学习中? 清华等最新《Transformer强化学习》综述论文详述进展

Transformer如何用到强化学习中? 清华等最新《Transformer强化学习》综述论文详述进展

专知会员服务

106+阅读 · 2023年1月10日

【伯克利博士论文】学习跨领域的可迁移表示

【伯克利博士论文】学习跨领域的可迁移表示

专知会员服务

47+阅读 · 2022年8月17日

【干货书】《Transformers 机器学习:深度探究》，Transformers for Machine Learning A Deep Dive

【干货书】《Transformers 机器学习:深度探究》，Transformers for Machine Learning A Deep Dive

专知会员服务

473+阅读 · 2022年4月21日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

《利用多模态移动传感器数据对健康进行建模的机器学习》剑桥大学博士论文

《利用多模态移动传感器数据对健康进行建模的机器学习》剑桥大学博士论文

专知

16+阅读 · 2022年5月3日

【干货书】《Transformers 机器学习:深度探究》，284页pdf

【干货书】《Transformers 机器学习:深度探究》，284页pdf

专知

72+阅读 · 2022年4月21日

【DeepMind】CrossTransformers: 空间感知的小样本迁移

【DeepMind】CrossTransformers: 空间感知的小样本迁移

专知

37+阅读 · 2020年7月26日

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

专知

48+阅读 · 2019年11月12日

TensorFlow 2.0官方Transformer教程 (Attention is All you Need)

TensorFlow 2.0官方Transformer教程 (Attention is All you Need)

专知

54+阅读 · 2019年4月12日

加入Transformer-XL，这个PyTorch包能调用各种NLP预训练模型

加入Transformer-XL，这个PyTorch包能调用各种NLP预训练模型

机器之心

15+阅读 · 2019年2月13日

分布式优化算法及其在多智能体系统与机器学习中的应用【附PPT与视频资料】

分布式优化算法及其在多智能体系统与机器学习中的应用【附PPT与视频资料】

人工智能前沿讲习班

21+阅读 · 2018年12月21日

Databricks 开源 MLflow 平台，解决机器学习开发四大难点

Databricks 开源 MLflow 平台，解决机器学习开发四大难点

AI研习社

13+阅读 · 2018年6月8日

迁移学习在深度学习中的应用

迁移学习在深度学习中的应用

专知

24+阅读 · 2017年12月24日

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

AI100

16+阅读 · 2017年12月23日

基于时空模式的复杂行为识别方法研究

国家自然科学基金

2+阅读 · 2017年12月31日

人类视空间分类的神经机制

国家自然科学基金

1+阅读 · 2015年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

主被动视角联合的细粒度行为识别

国家自然科学基金

1+阅读 · 2015年12月31日

空地机器人网络的同时视觉目标定位与分布式运动规划

国家自然科学基金

4+阅读 · 2015年12月31日

基于智能空间的云机器人行为知识驱动服务机制研究

国家自然科学基金

3+阅读 · 2015年12月31日

广域动态的野外环境中移动机器人六维全局定位方法的研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

多语言大数据环境下的复杂网络行为分析、预测和干预

国家自然科学基金

4+阅读 · 2014年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

DexEvolve: Evolutionary Optimization for Robust and Diverse Dexterous Grasp Synthesis

Arxiv

0+阅读 · 2月16日

Interpretability and Generalization Bounds for Learning Spatial Physics

Arxiv

0+阅读 · 2月9日

RFS: Reinforcement Learning with Residual Flow Steering for Dexterous Manipulation

Arxiv

0+阅读 · 2月5日

RFS: Reinforcement learning with Residual flow steering for dexterous manipulation

Arxiv

0+阅读 · 2月3日

RFS: Reinforcement learning with Residual flow steering for dexterous manipulation

Arxiv

0+阅读 · 2月2日

Flexible Multitask Learning with Factorized Diffusion Policy

Arxiv

0+阅读 · 2月1日

A Systematic Study of Data Modalities and Strategies for Co-training Large Behavior Models for Robot Manipulation

Arxiv

0+阅读 · 2月1日

Bridging the Gap Between Simulated and Real Network Data Using Transfer Learning

Arxiv

0+阅读 · 1月21日

Spatially Generalizable Mobile Manipulation via Adaptive Experience Selection and Dynamic Imagination

Arxiv

0+阅读 · 1月21日

Generalizable Domain Adaptation for Sim-and-Real Policy Co-Training

Arxiv

0+阅读 · 1月16日

VIP会员

文章信息

相关主题

分布外泛化

机器人操作

相关VIP内容

【博士论文】在低维和高维空间中分析、建模和转换潜在表征

【博士论文】在低维和高维空间中分析、建模和转换潜在表征

专知会员服务

18+阅读 · 2025年10月26日

深度学习中泛化的量化、理解与改进

深度学习中泛化的量化、理解与改进

专知会员服务

17+阅读 · 2025年9月13日

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

专知会员服务

22+阅读 · 2025年8月23日

【ETZH博士论文】低维与高维空间中潜在表示的分析、建模与变换，169页pdf

【ETZH博士论文】低维与高维空间中潜在表示的分析、建模与变换，169页pdf

专知会员服务

19+阅读 · 2025年7月30日

《时空变化领域中的学习与决策》134页

《时空变化领域中的学习与决策》134页

专知会员服务

16+阅读 · 2025年5月10日

【剑桥大学博士论文】机器学习中的分布外泛化，214页pdf

【剑桥大学博士论文】机器学习中的分布外泛化，214页pdf

专知会员服务

87+阅读 · 2023年9月13日

Transformer如何用到强化学习中? 清华等最新《Transformer强化学习》综述论文详述进展

Transformer如何用到强化学习中? 清华等最新《Transformer强化学习》综述论文详述进展

专知会员服务

106+阅读 · 2023年1月10日

【伯克利博士论文】学习跨领域的可迁移表示

【伯克利博士论文】学习跨领域的可迁移表示

专知会员服务

47+阅读 · 2022年8月17日

【干货书】《Transformers 机器学习:深度探究》，Transformers for Machine Learning A Deep Dive

【干货书】《Transformers 机器学习:深度探究》，Transformers for Machine Learning A Deep Dive

专知会员服务

473+阅读 · 2022年4月21日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体记忆深度剖析：评价指标与系统局限性的分类体系及实证分析

《可信人工智能赋能系统的支柱》

【CMU博士论文】可靠轨迹预测的分层基石：数据、评估与方法

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

相关资讯

《利用多模态移动传感器数据对健康进行建模的机器学习》剑桥大学博士论文

《利用多模态移动传感器数据对健康进行建模的机器学习》剑桥大学博士论文

专知

16+阅读 · 2022年5月3日

【干货书】《Transformers 机器学习:深度探究》，284页pdf

【干货书】《Transformers 机器学习:深度探究》，284页pdf

专知

72+阅读 · 2022年4月21日

【DeepMind】CrossTransformers: 空间感知的小样本迁移

【DeepMind】CrossTransformers: 空间感知的小样本迁移

专知

37+阅读 · 2020年7月26日

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

专知

48+阅读 · 2019年11月12日

TensorFlow 2.0官方Transformer教程 (Attention is All you Need)

TensorFlow 2.0官方Transformer教程 (Attention is All you Need)

专知

54+阅读 · 2019年4月12日

加入Transformer-XL，这个PyTorch包能调用各种NLP预训练模型

加入Transformer-XL，这个PyTorch包能调用各种NLP预训练模型

机器之心

15+阅读 · 2019年2月13日

分布式优化算法及其在多智能体系统与机器学习中的应用【附PPT与视频资料】

分布式优化算法及其在多智能体系统与机器学习中的应用【附PPT与视频资料】

人工智能前沿讲习班

21+阅读 · 2018年12月21日

Databricks 开源 MLflow 平台，解决机器学习开发四大难点

Databricks 开源 MLflow 平台，解决机器学习开发四大难点

AI研习社

13+阅读 · 2018年6月8日

迁移学习在深度学习中的应用

迁移学习在深度学习中的应用

专知

24+阅读 · 2017年12月24日

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

AI100

16+阅读 · 2017年12月23日

相关论文

DexEvolve: Evolutionary Optimization for Robust and Diverse Dexterous Grasp Synthesis

Arxiv

0+阅读 · 2月16日

Interpretability and Generalization Bounds for Learning Spatial Physics

Arxiv

0+阅读 · 2月9日

RFS: Reinforcement Learning with Residual Flow Steering for Dexterous Manipulation

Arxiv

0+阅读 · 2月5日

RFS: Reinforcement learning with Residual flow steering for dexterous manipulation

Arxiv

0+阅读 · 2月3日

RFS: Reinforcement learning with Residual flow steering for dexterous manipulation

Arxiv

0+阅读 · 2月2日

Flexible Multitask Learning with Factorized Diffusion Policy

Arxiv

0+阅读 · 2月1日

A Systematic Study of Data Modalities and Strategies for Co-training Large Behavior Models for Robot Manipulation

Arxiv

0+阅读 · 2月1日

Bridging the Gap Between Simulated and Real Network Data Using Transfer Learning

Arxiv

0+阅读 · 1月21日

Spatially Generalizable Mobile Manipulation via Adaptive Experience Selection and Dynamic Imagination

Arxiv

0+阅读 · 1月21日

Generalizable Domain Adaptation for Sim-and-Real Policy Co-Training

Arxiv

0+阅读 · 1月16日

相关基金

基于时空模式的复杂行为识别方法研究

国家自然科学基金

2+阅读 · 2017年12月31日

人类视空间分类的神经机制

国家自然科学基金

1+阅读 · 2015年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

主被动视角联合的细粒度行为识别

国家自然科学基金

1+阅读 · 2015年12月31日

空地机器人网络的同时视觉目标定位与分布式运动规划

国家自然科学基金

4+阅读 · 2015年12月31日

基于智能空间的云机器人行为知识驱动服务机制研究

国家自然科学基金

3+阅读 · 2015年12月31日

广域动态的野外环境中移动机器人六维全局定位方法的研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

多语言大数据环境下的复杂网络行为分析、预测和干预

国家自然科学基金

4+阅读 · 2014年12月31日

强化学习关键技术及其在机器人行为学习中的应用

国家自然科学基金

23+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员