Reinforcement Twinning for Hybrid Control of Flapping-Wing Drones - 专知论文

会员服务 ·

0

控制器 · Learning · AI · 回合 · MoDELS ·

Reinforcement Twinning for Hybrid Control of Flapping-Wing Drones

翻译：暂无翻译

Romain Poletti,Lorenzo Schena,Lilla Koloszar,Joris Degroote,Miguel Alfonso Mendez

Controlling flapping-wing drones requires controllers that handle time-varying, nonlinear, underactuated dynamics from incomplete, noisy sensor data. Recent advances in artificial intelligence (AI), particularly reinforcement learning (RL), have opened new perspectives for addressing such complex control problems through data-driven policy optimization from interaction with the environment. Yet purely data-driven methods are sample-inefficient, demanding extensive, sometimes unsafe exploration, especially without guiding physical models. This motivates hybrid AI-physics frameworks. This article proposes a hybrid model-free/model-based flight-control approach using the reinforcement twinning algorithm. The model-based (MB) component uses an adjoint formulation and an adaptive digital twin continuously identified from live trajectories; the model-free (MF) component uses RL. The two agents share knowledge via transfer learning, imitation learning, and shared experience between the real environment and the digital twin, coordinated by a policy referee that selects which agent acts in reality based on digital-twin performance and a real-to-virtual consistency ratio. The framework is evaluated for the longitudinal control of a flapping-wing drone, modelled as a nonlinear time-varying system driven by quasi-steady aerodynamic forces. The hybrid strategy is tested under three adaptive-model initializations: (1) offline identification from existing data, (2) random initialization with fully online identification, and (3) offline pre-training with biased parameters followed by online adaptation. In all cases, the hybrid framework improves performance, robustness, and sample efficiency over purely model-free and purely model-based approaches.

翻译：暂无翻译

0

相关内容

控制器

《防空协同制导：用于中段目标分配的多目标成本函数》

《防空协同制导：用于中段目标分配的多目标成本函数》

专知会员服务

22+阅读 · 5月6日

《仿生旋转尾翼设计对战斗机控制的空气动力学影响研究》490页博士论文

《仿生旋转尾翼设计对战斗机控制的空气动力学影响研究》490页博士论文

专知会员服务

14+阅读 · 2025年5月25日

《恶劣条件下无人驾驶 F/A-18 飞机的航母着陆控制》

《恶劣条件下无人驾驶 F/A-18 飞机的航母着陆控制》

专知会员服务

14+阅读 · 2024年11月30日

低空经济专题: 飞行器大脑——飞控系统

低空经济专题: 飞行器大脑——飞控系统

专知会员服务

41+阅读 · 2024年5月21日

《未来旋翼机飞行控制技术的进步与挑战》美国陆军

《未来旋翼机飞行控制技术的进步与挑战》美国陆军

专知会员服务

49+阅读 · 2023年2月26日

【AAAI2022】受限评委下双执行者的高效连续控制

【AAAI2022】受限评委下双执行者的高效连续控制

专知会员服务

17+阅读 · 2021年12月22日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【泡泡图灵智库】DenseFusion:基于迭代密集融合的6D目标姿态估计

【泡泡图灵智库】DenseFusion:基于迭代密集融合的6D目标姿态估计

泡泡机器人SLAM

16+阅读 · 2019年9月3日

【泡泡图灵智库】HSfM: 混合运动恢复结构（CVPR）

【泡泡图灵智库】HSfM: 混合运动恢复结构（CVPR）

泡泡机器人SLAM

11+阅读 · 2018年12月13日

浅析共轴双旋翼无人直升机系统设计

浅析共轴双旋翼无人直升机系统设计

无人机

21+阅读 · 2018年11月15日

【泡泡点云时空】SpiderCNN：利用参数化卷积滤波进行点集深度学习（ECCV2018-13）

【泡泡点云时空】SpiderCNN：利用参数化卷积滤波进行点集深度学习（ECCV2018-13）

泡泡机器人SLAM

10+阅读 · 2018年11月8日

Fully-Convolutional Siamese Networks for Object Tracking论文笔记

Fully-Convolutional Siamese Networks for Object Tracking论文笔记

统计学习与视觉计算组

10+阅读 · 2018年10月12日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

大神Geoffrey Hinton那篇备受关注的Capsule论文终于公开了

大神Geoffrey Hinton那篇备受关注的Capsule论文终于公开了

数据玩家

13+阅读 · 2017年10月28日

无人机飞行控制方法概述

无人机飞行控制方法概述

无人机

12+阅读 · 2017年10月7日

压电复合结构振动俘能与主动控制同步实现的理论与试验研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于等离子体合成射流激励的高超声速飞行器控制机理及控制方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于四维航迹运行的航路网络飞行安全间隔保持理论与方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

高超声速飞行器超紧耦合自主可靠导航方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

多旋翼无人飞行器大机动轨迹跟踪飞行非线性鲁棒控制

国家自然科学基金

2+阅读 · 2015年12月31日

基于TR-TomoPIV技术的蜻蜓前后翅扑翼相互作用及高升力产生新机制的实验研究

国家自然科学基金

1+阅读 · 2015年12月31日

倾转旋翼飞行器模态转换阶段非线性控制方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

脉冲式干扰下高超声速飞行器的有限时间状态受限控制

国家自然科学基金

0+阅读 · 2015年12月31日

带机械手的旋翼飞行机器人的稳定飞行控制

国家自然科学基金

6+阅读 · 2015年12月31日

复杂环境下异构自主海洋航行器网络的协同优化控制

国家自然科学基金

5+阅读 · 2015年12月31日

Foresight: Failure Detection for Long-Horizon Robotic Manipulation with Action-Conditioned World Model Latents

Arxiv

0+阅读 · 6月22日

Trajectory Forcing: Structure-First Generation with Controllable Semantic Trajectories

Arxiv

0+阅读 · 6月21日

Zero-shot Transfer of Reinforcement Learning Control Policies for the Swing-Up and Stabilization of a Cart-Pole System

Arxiv

0+阅读 · 6月20日

Patched Flow Matching: Generative Wall-Pressure Reconstruction Beyond Training-Domain Scales from Sparse Sensors

Arxiv

0+阅读 · 6月20日

A Multimodal Tiltwing Framework for Bioinspired Aerial Robots

Arxiv

0+阅读 · 6月20日

Cluster-Specific Localized Drift Detection for Efficient Batch Model Adaptation under Controlled Distribution Shift

Arxiv

0+阅读 · 6月20日

Frequency-Aware Flow Matching for Continuous and Consistent Robotic Action Generation

Arxiv

0+阅读 · 6月18日

OnDeFog: Online Decision Transformer under Frame Dropping

Arxiv

0+阅读 · 6月18日

Model-Reference Adaptive Flight Control of a 95-mg Insect-Scale Flapping-Wing Aerial Robot

Arxiv

0+阅读 · 6月18日

Unlocking air traffic flow prediction through microscopic aircraft-state modeling

Arxiv

0+阅读 · 6月17日

VIP会员

文章信息

相关主题

最新内容

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

0+阅读 · 4分钟前

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

1+阅读 · 16分钟前

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

1+阅读 · 27分钟前

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

1+阅读 · 36分钟前

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

1+阅读 · 40分钟前

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

1+阅读 · 44分钟前

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

1+阅读 · 48分钟前

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

5+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

8+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

6+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

5+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

8+阅读 · 6月22日

美国从乌克兰无人机战争中学习经验

美国从乌克兰无人机战争中学习经验

专知会员服务

7+阅读 · 6月21日

相关VIP内容

《防空协同制导：用于中段目标分配的多目标成本函数》

《防空协同制导：用于中段目标分配的多目标成本函数》

专知会员服务

22+阅读 · 5月6日

《仿生旋转尾翼设计对战斗机控制的空气动力学影响研究》490页博士论文

《仿生旋转尾翼设计对战斗机控制的空气动力学影响研究》490页博士论文

专知会员服务

14+阅读 · 2025年5月25日

《恶劣条件下无人驾驶 F/A-18 飞机的航母着陆控制》

《恶劣条件下无人驾驶 F/A-18 飞机的航母着陆控制》

专知会员服务

14+阅读 · 2024年11月30日

低空经济专题: 飞行器大脑——飞控系统

低空经济专题: 飞行器大脑——飞控系统

专知会员服务

41+阅读 · 2024年5月21日

《未来旋翼机飞行控制技术的进步与挑战》美国陆军

《未来旋翼机飞行控制技术的进步与挑战》美国陆军

专知会员服务

49+阅读 · 2023年2月26日

【AAAI2022】受限评委下双执行者的高效连续控制

【AAAI2022】受限评委下双执行者的高效连续控制

专知会员服务

17+阅读 · 2021年12月22日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

美以伊冲突：无人机与人工智能的运用

《特种部队在透明战场中的生存力》最新报告

相关资讯

【泡泡图灵智库】DenseFusion:基于迭代密集融合的6D目标姿态估计

【泡泡图灵智库】DenseFusion:基于迭代密集融合的6D目标姿态估计

泡泡机器人SLAM

16+阅读 · 2019年9月3日

【泡泡图灵智库】HSfM: 混合运动恢复结构（CVPR）

【泡泡图灵智库】HSfM: 混合运动恢复结构（CVPR）

泡泡机器人SLAM

11+阅读 · 2018年12月13日

浅析共轴双旋翼无人直升机系统设计

浅析共轴双旋翼无人直升机系统设计

无人机

21+阅读 · 2018年11月15日

【泡泡点云时空】SpiderCNN：利用参数化卷积滤波进行点集深度学习（ECCV2018-13）

【泡泡点云时空】SpiderCNN：利用参数化卷积滤波进行点集深度学习（ECCV2018-13）

泡泡机器人SLAM

10+阅读 · 2018年11月8日

Fully-Convolutional Siamese Networks for Object Tracking论文笔记

Fully-Convolutional Siamese Networks for Object Tracking论文笔记

统计学习与视觉计算组

10+阅读 · 2018年10月12日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

大神Geoffrey Hinton那篇备受关注的Capsule论文终于公开了

大神Geoffrey Hinton那篇备受关注的Capsule论文终于公开了

数据玩家

13+阅读 · 2017年10月28日

无人机飞行控制方法概述

无人机飞行控制方法概述

无人机

12+阅读 · 2017年10月7日

相关论文

Foresight: Failure Detection for Long-Horizon Robotic Manipulation with Action-Conditioned World Model Latents

Arxiv

0+阅读 · 6月22日

Trajectory Forcing: Structure-First Generation with Controllable Semantic Trajectories

Arxiv

0+阅读 · 6月21日

Zero-shot Transfer of Reinforcement Learning Control Policies for the Swing-Up and Stabilization of a Cart-Pole System

Arxiv

0+阅读 · 6月20日

Patched Flow Matching: Generative Wall-Pressure Reconstruction Beyond Training-Domain Scales from Sparse Sensors

Arxiv

0+阅读 · 6月20日

A Multimodal Tiltwing Framework for Bioinspired Aerial Robots

Arxiv

0+阅读 · 6月20日

Cluster-Specific Localized Drift Detection for Efficient Batch Model Adaptation under Controlled Distribution Shift

Arxiv

0+阅读 · 6月20日

Frequency-Aware Flow Matching for Continuous and Consistent Robotic Action Generation

Arxiv

0+阅读 · 6月18日

OnDeFog: Online Decision Transformer under Frame Dropping

Arxiv

0+阅读 · 6月18日

Model-Reference Adaptive Flight Control of a 95-mg Insect-Scale Flapping-Wing Aerial Robot

Arxiv

0+阅读 · 6月18日

Unlocking air traffic flow prediction through microscopic aircraft-state modeling

Arxiv

0+阅读 · 6月17日

相关基金

压电复合结构振动俘能与主动控制同步实现的理论与试验研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于等离子体合成射流激励的高超声速飞行器控制机理及控制方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于四维航迹运行的航路网络飞行安全间隔保持理论与方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

高超声速飞行器超紧耦合自主可靠导航方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

多旋翼无人飞行器大机动轨迹跟踪飞行非线性鲁棒控制

国家自然科学基金

2+阅读 · 2015年12月31日

基于TR-TomoPIV技术的蜻蜓前后翅扑翼相互作用及高升力产生新机制的实验研究

国家自然科学基金

1+阅读 · 2015年12月31日

倾转旋翼飞行器模态转换阶段非线性控制方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

脉冲式干扰下高超声速飞行器的有限时间状态受限控制

国家自然科学基金

0+阅读 · 2015年12月31日

带机械手的旋翼飞行机器人的稳定飞行控制

国家自然科学基金

6+阅读 · 2015年12月31日

复杂环境下异构自主海洋航行器网络的协同优化控制

国家自然科学基金

5+阅读 · 2015年12月31日

微信扫码咨询专知VIP会员