Reinforcement learning in linear embedding space unlocks generalizable control across soft robot configurations

Xinglong Zhang,Cong Li,Hangjie Mo,Yue Jiang,Xin Xu,Wei Jiang,Zhenshan Bing,Yihe Yang,Xiaojian Li,Yueneng Yang,Huimin Lu,Ling-li Zeng,Alois Knoll,Dewen Hu,Li Wen,Wei Pan

from arxiv, An updated version of this paper has been accepted by Nature Communications

Soft-bodied organisms such as octopuses and elephant trunks exhibit remarkable morphological adaptability, dynamically reconfiguring body shape and stiffness, and flexibly adjusting their control strategies to enable versatile behaviors. Inspired by these biological systems, various soft robots have emerged in recent decades, featuring diverse materials, stiffnesses, and morphologies tailored to specific tasks. Despite substantial advances in the materials and structural designs of soft robots, developing a generalizable control framework capable of rapid adaptation across diverse configurations remains a long-standing challenge. Existing controllers are limited to fixed configurations, demanding laborious configuration-specific remodelling and policy redesign for new configurations. Here, we introduce a generalizable control system that enables rapid adaptation across diverse soft robot configurations via reinforcement learning in a shared linear Koopman embedding space. By encoding robot dynamics into this embedding space, our method decouples control policies from specific morphologies, allowing real-time, model-free policy adaptation across diverse configurations without retraining from scratch. We validate our system across 33 distinct robot configurations. Our system achieves a 75 times reduction in transfer samples across configurations, while sustaining robust performance under high-speed motion, heavy payloads, and multiactuator faults, and achieving real-world skills previously unattainable in soft robotics. This work establishes a unified and adaptable control paradigm for diverse soft robot configurations, bridging mechanical reconfigurability with control flexibility, and may offer broader insights for generalizable control in complex physical systems.

翻译：软体生物如章鱼和象鼻展现出卓越的形态适应性，能够动态重构身体形状与刚度，并灵活调整控制策略以实现多样化的行为。受这些生物系统启发，近几十年来涌现出多种软体机器人，其采用针对特定任务定制的不同材料、刚度和形态。尽管软体机器人的材料与结构设计取得了重大进展，但开发一个能够跨多种构型快速适应的泛化控制框架仍是长期存在的挑战。现有控制器局限于固定构型，需要针对新构型进行耗时的特定构型重构和策略再设计。本文提出一种泛化控制系统，通过在线性Koopman嵌入空间中进行强化学习，实现跨多种软体机器人构型的快速适应。通过将机器人动力学编码至该嵌入空间，我们的方法将控制策略与具体形态解耦，使得无需从头重新训练即可在多样构型间实现实时无模型策略适应。我们在33种不同机器人构型上验证了该系统，其跨构型的迁移样本量减少了75倍，同时在高速度运动、重载荷及多执行器故障条件下保持稳健性能，并实现了软体机器人领域此前难以企及的真实世界技能。本工作为多样软体机器人构型建立了统一且可适应的控制范式，将机械可重构性与控制灵活性相衔接，或可为复杂物理系统中的泛化控制提供更广泛的洞见。

相关内容

软体机器人

关注 3

软体机器人是一种新型柔软机器人，能够适应各种非结构化环境，与人类的交互也更安全。机器人本体利用柔软材料制作，一般认为是杨氏模量低于人类肌肉的材料；区别于传统机器人电机驱动，软体机器人的驱动方式主要取决于所使用的智能材料；一般有介电弹性体（DE）、离子聚合物金属复合材料（IPMC）、形状记忆合金（SMA）、形状记忆聚合物（SMP）等等，从响应的物理量暂时分为如下几类：电场、压力、磁场、化学反应、光、温度。科学家依此设计了各种各样的软体机器人，大多数软体机器人的设计是模仿自然界各种生物，如蚯蚓、章鱼、水母等。

【伯克利博士论文】物理世界中可泛化且可扩展的机器人学习

专知会员服务

22+阅读 · 1月18日

专业软件开发者不靠“氛围编程”（Vibe Coding），而靠“控制”：2025 年 AI Agent 在编程中的应用研究

专知会员服务

22+阅读 · 2025年12月31日

深度强化学习与模仿学习导论

专知会员服务

25+阅读 · 2025年12月10日

【斯坦福大学博士论文】学习连续体机器人控制中的主要动力学

专知会员服务

16+阅读 · 2025年4月19日