Sampling-based Reactive Synthesis for Nondeterministic Hybrid Systems - 专知论文

会员服务 ·

0

混合系统 · 混合 · 合成 · 算法 · 策略合成 ·

2023 年 4 月 14 日

Sampling-based Reactive Synthesis for Nondeterministic Hybrid Systems

翻译：基于采样的非确定性混合系统反应式综合算法

Qi Heng Ho,Zachary N. Sunberg,Morteza Lahijanian

from arxiv, 9 pages, 9 figures, submitted to 62nd IEEE Conference on Decision and Control 2023

This paper introduces a sampling-based strategy synthesis algorithm for nondeterministic hybrid systems with complex continuous dynamics under temporal and reachability constraints. We view the evolution of the hybrid system as a two-player game, where the nondeterminism is an adversarial player whose objective is to prevent achieving temporal and reachability goals. The aim is to synthesize a winning strategy -- a reactive (robust) strategy that guarantees the satisfaction of the goals under all possible moves of the adversarial player. The approach is based on growing a (search) game-tree in the hybrid space by combining a sampling-based planning method with a novel bandit-based technique to select and improve on partial strategies. We provide conditions under which the algorithm is probabilistically complete, i.e., if a winning strategy exists, the algorithm will almost surely find it. The case studies and benchmark results show that the algorithm is general and consistently outperforms the state of the art.

翻译：本文提出了一种基于采样的策略综合算法，用于处理具有复杂连续动力学且受时间与可达性约束的非确定性混合系统。我们将混合系统的演化建模为一个双人博弈，其中非确定性行为由一个对抗性玩家控制，其目标是阻止系统实现时间与可达性目标。本文旨在合成一种获胜策略——一种反应式（鲁棒）策略，确保在所有可能的对抗性玩家动作下目标均能得到满足。该方法通过在混合空间中生长（搜索）博弈树实现：将基于采样的规划方法与一种新颖的基于置信区间上界的技术相结合，用于选择并改进局部策略。我们给出了算法概率完备的条件，即若存在获胜策略，算法几乎必然能找到该策略。案例研究与基准测试结果表明，该算法具备通用性，且持续优于现有最优方法。

0

相关内容

混合系统

【Facebook-Ishan Mishra】计算机视觉自监督学习，92页ppt

专知会员服务

36+阅读 · 2021年7月7日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【MILA-唐建】学习知识图谱推理的符号逻辑规则，附视频与PPT

【MILA-唐建】学习知识图谱推理的符号逻辑规则，附视频与PPT

专知会员服务

85+阅读 · 2021年2月13日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【AAMSA 2019 | tutorial】多智能体系统中的认知推理Epistemic Reasoning In Multiagent Systems ,法国雷恩François Schwarzentruber

【AAMSA 2019 | tutorial】多智能体系统中的认知推理Epistemic Reasoning In Multiagent Systems ,法国雷恩François Schwarzentruber

专知会员服务

24+阅读 · 2019年5月14日

TGANv2、VideoGPT、DVG…你都掌握了吗？一文总结视频生成必备经典模型（二）

TGANv2、VideoGPT、DVG…你都掌握了吗？一文总结视频生成必备经典模型（二）

机器之心

0+阅读 · 2022年11月13日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

基于直链的杂交链式反应在核酸与蛋白质检测中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

基于多源模板重构的社交网络垃圾信息在线检测方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于时空间排他性块匹配法的人物行为识别技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

不确定条件下基于单个外辐射源的无源导航方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于近似动态规划理论的电力系统随机动态经济调度研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于非光滑分析与优化方法的混杂博弈研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于优化理论的变几何水轮机的流动机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于"非监督-监督-激励"集成学习模式的机器人行为自主学习系统研究

国家自然科学基金

1+阅读 · 2010年12月31日

活性氧在糖尿病视网膜病变“#20195;谢记忆”#20013;的作用及意义

国家自然科学基金

0+阅读 · 2009年12月31日

工作记忆与情景记忆重复效应的神经机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Direct Diffusion Bridge using Data Consistency for Inverse Problems

Arxiv

0+阅读 · 2023年5月31日

Efficient Training of Energy-Based Models Using Jarzynski Equality

Arxiv

0+阅读 · 2023年5月30日

HySST: A Stable Sparse Rapidly-Exploring Random Trees Optimal Motion Planning Algorithm for Hybrid Dynamical Systems

Arxiv

0+阅读 · 2023年5月29日

Maximum Optimality Margin: A Unified Approach for Contextual Linear Programming and Inverse Linear Programming

Arxiv

0+阅读 · 2023年5月28日

Diversity-seeking Jump Games in Networks

Arxiv

0+阅读 · 2023年5月28日

Diffusion Model Based Posterior Sampling for Noisy Linear Inverse Problems

Arxiv

0+阅读 · 2023年5月28日

A Copositive Framework for Analysis of Hybrid Ising-Classical Algorithms

Arxiv

0+阅读 · 2023年5月26日

Decision Diagram-Based Branch-and-Bound with Caching for Dominance and Suboptimality Detection

Arxiv

0+阅读 · 2023年5月26日

Quantum Merlin-Arthur proof systems for synthesizing quantum states

Arxiv

0+阅读 · 2023年5月26日

Model-Based Simulation for Optimising Smart Reply

Arxiv

0+阅读 · 2023年5月26日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

1+阅读 · 今天14:45

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

1+阅读 · 今天14:43

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

3+阅读 · 今天14:31

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

3+阅读 · 今天14:20

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

2+阅读 · 今天14:11

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

3+阅读 · 今天14:07

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

3+阅读 · 今天14:03

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

2+阅读 · 今天13:59

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

5+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

8+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

7+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

5+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

8+阅读 · 6月22日

相关VIP内容

【Facebook-Ishan Mishra】计算机视觉自监督学习，92页ppt

专知会员服务

36+阅读 · 2021年7月7日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【MILA-唐建】学习知识图谱推理的符号逻辑规则，附视频与PPT

【MILA-唐建】学习知识图谱推理的符号逻辑规则，附视频与PPT

专知会员服务

85+阅读 · 2021年2月13日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【AAMSA 2019 | tutorial】多智能体系统中的认知推理Epistemic Reasoning In Multiagent Systems ,法国雷恩François Schwarzentruber

【AAMSA 2019 | tutorial】多智能体系统中的认知推理Epistemic Reasoning In Multiagent Systems ,法国雷恩François Schwarzentruber

专知会员服务

24+阅读 · 2019年5月14日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 世界动作模型：少做梦，多行动

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

美以伊冲突：无人机与人工智能的运用

相关资讯

TGANv2、VideoGPT、DVG…你都掌握了吗？一文总结视频生成必备经典模型（二）

TGANv2、VideoGPT、DVG…你都掌握了吗？一文总结视频生成必备经典模型（二）

机器之心

0+阅读 · 2022年11月13日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Direct Diffusion Bridge using Data Consistency for Inverse Problems

Arxiv

0+阅读 · 2023年5月31日

Efficient Training of Energy-Based Models Using Jarzynski Equality

Arxiv

0+阅读 · 2023年5月30日

HySST: A Stable Sparse Rapidly-Exploring Random Trees Optimal Motion Planning Algorithm for Hybrid Dynamical Systems

Arxiv

0+阅读 · 2023年5月29日

Maximum Optimality Margin: A Unified Approach for Contextual Linear Programming and Inverse Linear Programming

Arxiv

0+阅读 · 2023年5月28日

Diversity-seeking Jump Games in Networks

Arxiv

0+阅读 · 2023年5月28日

Diffusion Model Based Posterior Sampling for Noisy Linear Inverse Problems

Arxiv

0+阅读 · 2023年5月28日

A Copositive Framework for Analysis of Hybrid Ising-Classical Algorithms

Arxiv

0+阅读 · 2023年5月26日

Decision Diagram-Based Branch-and-Bound with Caching for Dominance and Suboptimality Detection

Arxiv

0+阅读 · 2023年5月26日

Quantum Merlin-Arthur proof systems for synthesizing quantum states

Arxiv

0+阅读 · 2023年5月26日

Model-Based Simulation for Optimising Smart Reply

Arxiv

0+阅读 · 2023年5月26日

相关基金

基于直链的杂交链式反应在核酸与蛋白质检测中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

基于多源模板重构的社交网络垃圾信息在线检测方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于时空间排他性块匹配法的人物行为识别技术研究

国家自然科学基金

0+阅读 · 2013年12月31日

不确定条件下基于单个外辐射源的无源导航方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于近似动态规划理论的电力系统随机动态经济调度研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于非光滑分析与优化方法的混杂博弈研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于优化理论的变几何水轮机的流动机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于"非监督-监督-激励"集成学习模式的机器人行为自主学习系统研究

国家自然科学基金

1+阅读 · 2010年12月31日

活性氧在糖尿病视网膜病变“#20195;谢记忆”#20013;的作用及意义

国家自然科学基金

0+阅读 · 2009年12月31日

工作记忆与情景记忆重复效应的神经机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员