Enabling A Network AI Gym for Autonomous Cyber Agents - 专知论文

会员服务 ·

0

网络仿真 · 脱机 · 网络安全 · 高保真 · AI ·

2023 年 4 月 3 日

Enabling A Network AI Gym for Autonomous Cyber Agents

翻译：为自主网络智能体构建网络人工智能训练场

Li Li,Jean-Pierre S. El Rami,Adrian Taylor,James Hailing Rao,Thomas Kunz

from arxiv, To appear in Proceedings of the 2022 International Conference on Computational Science and Computational Intelligence

This work aims to enable autonomous agents for network cyber operations (CyOps) by applying reinforcement and deep reinforcement learning (RL/DRL). The required RL training environment is particularly challenging, as it must balance the need for high-fidelity, best achieved through real network emulation, with the need for running large numbers of training episodes, best achieved using simulation. A unified training environment, namely the Cyber Gym for Intelligent Learning (CyGIL) is developed where an emulated CyGIL-E automatically generates a simulated CyGIL-S. From preliminary experimental results, CyGIL-S is capable to train agents in minutes compared with the days required in CyGIL-E. The agents trained in CyGIL-S are transferrable directly to CyGIL-E showing full decision proficiency in the emulated "real" network. Enabling offline RL, the CyGIL solution presents a promising direction towards sim-to-real for leveraging RL agents in real-world cyber networks.

翻译：本研究旨在通过强化学习与深度强化学习技术，为网络作战（CyOps）构建自主智能体。所需的强化学习训练环境极具挑战性，既要通过真实网络仿真实现高保真度，又要通过模拟器实现大量训练回合的运行。本文开发了统一训练环境——智能学习网络训练场（CyGIL），其中仿真环境CyGIL-E可自动生成模拟环境CyGIL-S。初步实验结果表明，CyGIL-S能在数分钟内完成智能体训练，而CyGIL-E则需要数天时间。经CyGIL-S训练的智能体可直接迁移至CyGIL-E，在仿真的"真实"网络中展现出完整的决策能力。通过支持离线强化学习，CyGIL方案为在真实网络环境中部署强化学习智能体提供了从仿真到现实的可行方向。

0

相关内容

网络仿真

148页最新《深度强化学习》教程，148页ppt

148页最新《深度强化学习》教程，148页ppt

专知会员服务

77+阅读 · 2023年4月29日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

24+阅读 · 2022年3月19日

系列教程GNN-algorithms之七：《图同构网络—GIN》

系列教程GNN-algorithms之七：《图同构网络—GIN》

专知会员服务

48+阅读 · 2020年8月9日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

159+阅读 · 2020年8月7日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

专知会员服务

16+阅读 · 2019年11月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

使用 JAX 构建强化学习 agent，并借助 TensorFlow Lite 将其部署到 Android 应用中

使用 JAX 构建强化学习 agent，并借助 TensorFlow Lite 将其部署到 Android 应用中

谷歌开发者

0+阅读 · 2022年11月1日

CALDERA 一款对手自动模拟工具

CALDERA 一款对手自动模拟工具

黑白之道

20+阅读 · 2019年9月17日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

tensorflow Object Detection API使用预训练模型mask r-cnn实现对象检测

tensorflow Object Detection API使用预训练模型mask r-cnn实现对象检测

极市平台

12+阅读 · 2018年8月24日

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

专知

23+阅读 · 2018年2月23日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于重要性采样的并行离策略强化学习方法研究

国家自然科学基金

24+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

47+阅读 · 2015年12月31日

铸造高硼高速钢硼碳化物调控及其耐磨性研究

国家自然科学基金

0+阅读 · 2014年12月31日

非自由部署空间中无线传感器网络查询处理技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

三维互联网应用中的服饰实时动画关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向ISM频段无线传感器网络的合作共存与优化技术

国家自然科学基金

0+阅读 · 2012年12月31日

基于用户偏好感知的SaaS服务选择优化研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属茂基聚合型阻燃抑烟剂的制备及其催化交联成炭机理

国家自然科学基金

0+阅读 · 2012年12月31日

社会依赖演化网调控的Agent服务协同自适应

国家自然科学基金

0+阅读 · 2012年12月31日

基于IEEE802.11n的长距离无线mesh网络理论与关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

Variable Grasp Pose and Commitment for Trajectory Optimization

Arxiv

0+阅读 · 2023年5月21日

Autonomous GIS: the next-generation AI-powered GIS

Arxiv

0+阅读 · 2023年5月20日

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving

Arxiv

0+阅读 · 2023年5月20日

Vision-based DRL Autonomous Driving Agent with Sim2Real Transfer

Arxiv

0+阅读 · 2023年5月19日

Dive into the Power of Neuronal Heterogeneity

Arxiv

0+阅读 · 2023年5月19日

LMEye: An Interactive Perception Network for Large Language Models

Arxiv

0+阅读 · 2023年5月19日

Collective Reasoning for Safe Autonomous Systems

Arxiv

0+阅读 · 2023年5月18日

Autonomous Drone Racing: A Survey

Arxiv

27+阅读 · 2023年1月5日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

50+阅读 · 2021年1月6日

VIP会员

文章信息

相关主题

最新内容

DARPA拟打造十万规模自主思考作战的AI智能体集群：“受控涌现式分布式人工智能”（DICE）项目

DARPA拟打造十万规模自主思考作战的AI智能体集群：“受控涌现式分布式人工智能”（DICE）项目

专知会员服务

0+阅读 · 7月17日

《边缘端实时无线感知赋能现场多机器人部署》200页

《边缘端实时无线感知赋能现场多机器人部署》200页

专知会员服务

2+阅读 · 7月17日

战力倍增器：自主武器系统与乌克兰及加沙冲突

战力倍增器：自主武器系统与乌克兰及加沙冲突

专知会员服务

1+阅读 · 7月17日

人工智能赋能战场情报：提速决策进程

人工智能赋能战场情报：提速决策进程

专知会员服务

0+阅读 · 7月17日

《拥抱新兴技术：面向未来军官的教育革新》

《拥抱新兴技术：面向未来军官的教育革新》

专知会员服务

2+阅读 · 7月17日

ACM MM 2026 | MAR-GRPO：稳定混合图像生成的强化学习训练

ACM MM 2026 | MAR-GRPO：稳定混合图像生成的强化学习训练

专知会员服务

0+阅读 · 7月17日

综述 | 大模型水印理论与部署：来源追踪、攻击鲁棒与可信治理

综述 | 大模型水印理论与部署：来源追踪、攻击鲁棒与可信治理

专知会员服务

0+阅读 · 7月17日

《火线上的后勤保障：对抗环境下的随机规划模型研究——俄乌场景案例分析》99页

《火线上的后勤保障：对抗环境下的随机规划模型研究——俄乌场景案例分析》99页

专知会员服务

11+阅读 · 7月16日

《无人地面战车（UGV）的崛起》报告

《无人地面战车（UGV）的崛起》报告

专知会员服务

7+阅读 · 7月16日

《无人机参数化与集群飞行创新项目的监控流程管理：模型、策略及自适应解决方案》

《无人机参数化与集群飞行创新项目的监控流程管理：模型、策略及自适应解决方案》

专知会员服务

6+阅读 · 7月16日

《美军开放式任务系统（OMS）定义与文档（D&D）——Java关键抽象层（CAL）接口生成规范》47页标准

《美军开放式任务系统（OMS）定义与文档（D&D）——Java关键抽象层（CAL）接口生成规范》47页标准

专知会员服务

12+阅读 · 7月16日

美陆军任务式指挥人工智能解决方案

美陆军任务式指挥人工智能解决方案

专知会员服务

11+阅读 · 7月16日

ICML 2026 | 理论级自动形式化：从孤立命题到统一形式化知识库

ICML 2026 | 理论级自动形式化：从孤立命题到统一形式化知识库

专知会员服务

9+阅读 · 7月16日

综述 | 现代智能体自我改进，从模型更新到脚手架演化

综述 | 现代智能体自我改进，从模型更新到脚手架演化

专知会员服务

15+阅读 · 7月16日

美国陆军宣布“项目融合-顶点6”：现代化进程的关键里程碑

美国陆军宣布“项目融合-顶点6”：现代化进程的关键里程碑

专知会员服务

13+阅读 · 7月15日

相关VIP内容

148页最新《深度强化学习》教程，148页ppt

148页最新《深度强化学习》教程，148页ppt

专知会员服务

77+阅读 · 2023年4月29日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

24+阅读 · 2022年3月19日

系列教程GNN-algorithms之七：《图同构网络—GIN》

系列教程GNN-algorithms之七：《图同构网络—GIN》

专知会员服务

48+阅读 · 2020年8月9日

【DeepMind】强化学习教程，83页ppt

【DeepMind】强化学习教程，83页ppt

专知会员服务

159+阅读 · 2020年8月7日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

【CVPR 2019 | tutorial】自主汽车的感知、预测和大规模数据采集：Perception, Prediction, and Large Scale Data Collection for Autonomous Cars

专知会员服务

33+阅读 · 2019年11月28日

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

【O'Reilly TensorFlow Conference 2019】不要打败市场；击败机器人：金融对抗网络（Don’t beat the market; beat the bots: Adversarial networks in finance），Manceps机器学习架构师Garrett Lander、首席执行官兼首席顾问Al Kari

专知会员服务

16+阅读 · 2019年11月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《边缘端实时无线感知赋能现场多机器人部署》200页

人工智能赋能战场情报：提速决策进程

DARPA拟打造十万规模自主思考作战的AI智能体集群：“受控涌现式分布式人工智能”（DICE）项目

战力倍增器：自主武器系统与乌克兰及加沙冲突

相关资讯

使用 JAX 构建强化学习 agent，并借助 TensorFlow Lite 将其部署到 Android 应用中

使用 JAX 构建强化学习 agent，并借助 TensorFlow Lite 将其部署到 Android 应用中

谷歌开发者

0+阅读 · 2022年11月1日

CALDERA 一款对手自动模拟工具

CALDERA 一款对手自动模拟工具

黑白之道

20+阅读 · 2019年9月17日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

tensorflow Object Detection API使用预训练模型mask r-cnn实现对象检测

tensorflow Object Detection API使用预训练模型mask r-cnn实现对象检测

极市平台

12+阅读 · 2018年8月24日

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

【论文推荐】最新六篇生成式对抗网络（GAN）相关论文—半监督学习、对偶、交互生成对抗网络、激活、纳什均衡、tempoGAN

专知

23+阅读 · 2018年2月23日

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

【论文推荐】最新5篇目标跟踪（Object Tracking）相关论文—并行跟踪和验证、光流、自动跟踪、相关滤波集成、CFNet

专知

25+阅读 · 2018年2月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Variable Grasp Pose and Commitment for Trajectory Optimization

Arxiv

0+阅读 · 2023年5月21日

Autonomous GIS: the next-generation AI-powered GIS

Arxiv

0+阅读 · 2023年5月20日

DAMO-StreamNet: Optimizing Streaming Perception in Autonomous Driving

Arxiv

0+阅读 · 2023年5月20日

Vision-based DRL Autonomous Driving Agent with Sim2Real Transfer

Arxiv

0+阅读 · 2023年5月19日

Dive into the Power of Neuronal Heterogeneity

Arxiv

0+阅读 · 2023年5月19日

LMEye: An Interactive Perception Network for Large Language Models

Arxiv

0+阅读 · 2023年5月19日

Collective Reasoning for Safe Autonomous Systems

Arxiv

0+阅读 · 2023年5月18日

Autonomous Drone Racing: A Survey

Arxiv

27+阅读 · 2023年1月5日

Imitation Learning: Progress, Taxonomies and Opportunities

Arxiv

12+阅读 · 2021年6月23日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

50+阅读 · 2021年1月6日

相关基金

基于重要性采样的并行离策略强化学习方法研究

国家自然科学基金

24+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

47+阅读 · 2015年12月31日

铸造高硼高速钢硼碳化物调控及其耐磨性研究

国家自然科学基金

0+阅读 · 2014年12月31日

非自由部署空间中无线传感器网络查询处理技术研究

国家自然科学基金

1+阅读 · 2013年12月31日

三维互联网应用中的服饰实时动画关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向ISM频段无线传感器网络的合作共存与优化技术

国家自然科学基金

0+阅读 · 2012年12月31日

基于用户偏好感知的SaaS服务选择优化研究

国家自然科学基金

0+阅读 · 2012年12月31日

金属茂基聚合型阻燃抑烟剂的制备及其催化交联成炭机理

国家自然科学基金

0+阅读 · 2012年12月31日

社会依赖演化网调控的Agent服务协同自适应

国家自然科学基金

0+阅读 · 2012年12月31日

基于IEEE802.11n的长距离无线mesh网络理论与关键技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员