Open-ended search for environments and adapted agents using MAP-Elites - 专知论文

会员服务 ·

0

搜索 · 生物 · 结构 · 结构化 · 无限 ·

2023 年 5 月 2 日

Open-ended search for environments and adapted agents using MAP-Elites

翻译：开放式环境与自适应智能体的MAP-Elites搜索

Emma Stensby Norstein,Kai Olav Ellefsen,Kyrre Glette

Creatures in the real world constantly encounter new and diverse challenges they have never seen before. They will often need to adapt to some of these tasks and solve them in order to survive. This almost endless world of novel challenges is not as common in virtual environments, where artificially evolving agents often have a limited set of tasks to solve. An exception to this is the field of open-endedness where the goal is to create unbounded exploration of interesting artefacts. We want to move one step closer to creating simulated environments similar to the diverse real world, where agents can both find solvable tasks, and adapt to them. Through the use of MAP-Elites we create a structured repertoire, a map, of terrains and virtual creatures that locomote through them. By using novelty as a dimension in the grid, the map can continuously develop to encourage exploration of new environments. The agents must adapt to the environments found, but can also search for environments within each cell of the grid to find the one that best fits their set of skills. Our approach combines the structure of MAP-Elites, which can allow the virtual creatures to use adjacent cells as stepping stones to solve increasingly difficult environments, with open-ended innovation. This leads to a search that is unbounded, but still has a clear structure. We find that while handcrafted bounded dimensions for the map lead to quicker exploration of a large set of environments, both the bounded and unbounded approach manage to solve a diverse set of terrains.

翻译：真实世界中的生物不断面临前所未有的新奇多样挑战。它们常常需要适应其中某些任务并加以解决以求生存。这种近乎无尽的新奇挑战世界在虚拟环境中并不常见——人工进化智能体通常只需解决有限的任务集。开放式进化领域是个例外，其目标是创造对有趣人工制品无边界探索的系统。我们旨在更进一步构建类似真实世界多样性的模拟环境，使智能体既能发现可解任务，又能适应这些任务。通过运用MAP-Elites算法，我们构建了地形与穿行其中的虚拟生物的结构化库——即"地图"。以新颖性作为网格维度，该地图可持续发展以鼓励对新环境的探索。智能体必须适应所发现的环境，同时也能在每个网格单元内搜索最适合其技能组合的环境。我们的方法将MAP-Elites的结构（允许虚拟生物利用相邻单元作为踏脚石攻克日益复杂的环境）与开放式创新相结合，实现了既有明确框架又不设限的搜索。研究发现：虽然人工设计的边界化维度能快速探索大量环境，但无论采用有界还是无界方法，都能成功解决多样化的地形挑战。

0

相关内容

互联网

【伯克利博士论文】机器人机械搜索的操作与感知策略

【伯克利博士论文】机器人机械搜索的操作与感知策略

专知会员服务

16+阅读 · 2022年6月4日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

105+阅读 · 2022年2月10日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

55+阅读 · 2021年4月11日

【干货书】Python程序员编程，810页pdf，Python® for Programmers

【干货书】Python程序员编程，810页pdf，Python® for Programmers

专知会员服务

62+阅读 · 2020年8月6日

【Facebook|AAAI2020】在合作的部分可观察博弈中通过搜索改进策略（Improving Policies via Search in Cooperative Partially Observable Games）

【Facebook|AAAI2020】在合作的部分可观察博弈中通过搜索改进策略（Improving Policies via Search in Cooperative Partially Observable Games）

专知会员服务

16+阅读 · 2019年12月10日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

TensorFlow 2.0 学习资源汇总

TensorFlow 2.0 学习资源汇总

专知会员服务

67+阅读 · 2019年10月9日

RL解决'LunarLander-v2' (SOTA)

RL解决'LunarLander-v2' (SOTA)

CreateAMind

62+阅读 · 2019年9月27日

谷歌足球游戏环境使用介绍

谷歌足球游戏环境使用介绍

CreateAMind

33+阅读 · 2019年6月27日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇视觉目标跟踪相关论文—递归神经网络、深度适应计算策略、视觉目标跟踪基准、深度核化相关滤波、检测并跟踪

【论文推荐】最新5篇视觉目标跟踪相关论文—递归神经网络、深度适应计算策略、视觉目标跟踪基准、深度核化相关滤波、检测并跟踪

专知

14+阅读 · 2018年1月22日

流场环境下打结高分子链的构象及动力学行为的研究

国家自然科学基金

0+阅读 · 2015年12月31日

海量RDF数据探索式搜索关键技术与系统研究

国家自然科学基金

0+阅读 · 2014年12月31日

异构动态移动通信网络的延时优化

国家自然科学基金

2+阅读 · 2013年12月31日

P2P-Grid环境中分布式不确定本体模型的研究

国家自然科学基金

0+阅读 · 2013年12月31日

无线网络中跨周界环境区分的研究

国家自然科学基金

0+阅读 · 2012年12月31日

无人车越野环境感知关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

大质量恒星形成环境的红外亚毫米波观测新研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于移动Agent的无线传感器网络数据处理技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

Internet环境中基于语义Web的开放式决策支持系统关键技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

传感器网络下分布式多目标跟踪方法研究

国家自然科学基金

1+阅读 · 2009年12月31日

Sample-Efficient On-Policy Imitation Learning from Observations

Arxiv

0+阅读 · 2023年6月16日

Multi-Objective and Model-Predictive Tree Search for Spatiotemporal Informative Planning

Arxiv

0+阅读 · 2023年6月16日

House-Swapping with Objective Indifferences

Arxiv

0+阅读 · 2023年6月15日

Evolving Populations of Diverse RL Agents with MAP-Elites

Arxiv

0+阅读 · 2023年6月15日

Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery

Arxiv

0+阅读 · 2023年6月15日

ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations

Arxiv

0+阅读 · 2023年6月13日

Artificial Intelligence for the Metaverse: A Survey

Arxiv

31+阅读 · 2022年2月15日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

50+阅读 · 2021年1月6日

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Arxiv

12+阅读 · 2019年9月26日

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs

Arxiv

21+阅读 · 2018年1月16日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

2+阅读 · 6月23日

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

4+阅读 · 6月23日

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

7+阅读 · 6月23日

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

3+阅读 · 6月23日

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

4+阅读 · 6月23日

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

6+阅读 · 6月23日

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

5+阅读 · 6月23日

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

3+阅读 · 6月23日

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

6+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

8+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

8+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

6+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

9+阅读 · 6月22日

相关VIP内容

【伯克利博士论文】机器人机械搜索的操作与感知策略

【伯克利博士论文】机器人机械搜索的操作与感知策略

专知会员服务

16+阅读 · 2022年6月4日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

105+阅读 · 2022年2月10日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

55+阅读 · 2021年4月11日

【干货书】Python程序员编程，810页pdf，Python® for Programmers

【干货书】Python程序员编程，810页pdf，Python® for Programmers

专知会员服务

62+阅读 · 2020年8月6日

【Facebook|AAAI2020】在合作的部分可观察博弈中通过搜索改进策略（Improving Policies via Search in Cooperative Partially Observable Games）

【Facebook|AAAI2020】在合作的部分可观察博弈中通过搜索改进策略（Improving Policies via Search in Cooperative Partially Observable Games）

专知会员服务

16+阅读 · 2019年12月10日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

TensorFlow 2.0 学习资源汇总

TensorFlow 2.0 学习资源汇总

专知会员服务

67+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 世界动作模型：少做梦，多行动

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

美以伊冲突：无人机与人工智能的运用

相关资讯

RL解决'LunarLander-v2' (SOTA)

RL解决'LunarLander-v2' (SOTA)

CreateAMind

62+阅读 · 2019年9月27日

谷歌足球游戏环境使用介绍

谷歌足球游戏环境使用介绍

CreateAMind

33+阅读 · 2019年6月27日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇视觉目标跟踪相关论文—递归神经网络、深度适应计算策略、视觉目标跟踪基准、深度核化相关滤波、检测并跟踪

【论文推荐】最新5篇视觉目标跟踪相关论文—递归神经网络、深度适应计算策略、视觉目标跟踪基准、深度核化相关滤波、检测并跟踪

专知

14+阅读 · 2018年1月22日

相关论文

Sample-Efficient On-Policy Imitation Learning from Observations

Arxiv

0+阅读 · 2023年6月16日

Multi-Objective and Model-Predictive Tree Search for Spatiotemporal Informative Planning

Arxiv

0+阅读 · 2023年6月16日

House-Swapping with Objective Indifferences

Arxiv

0+阅读 · 2023年6月15日

Evolving Populations of Diverse RL Agents with MAP-Elites

Arxiv

0+阅读 · 2023年6月15日

Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery

Arxiv

0+阅读 · 2023年6月15日

ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations

Arxiv

0+阅读 · 2023年6月13日

Artificial Intelligence for the Metaverse: A Survey

Arxiv

31+阅读 · 2022年2月15日

Adaptive Synthetic Characters for Military Training

Adaptive Synthetic Characters for Military Training

Arxiv

50+阅读 · 2021年1月6日

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Representation Learning with Ordered Relation Paths for Knowledge Graph Completion

Arxiv

12+阅读 · 2019年9月26日

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs

Arxiv

21+阅读 · 2018年1月16日

相关基金

流场环境下打结高分子链的构象及动力学行为的研究

国家自然科学基金

0+阅读 · 2015年12月31日

海量RDF数据探索式搜索关键技术与系统研究

国家自然科学基金

0+阅读 · 2014年12月31日

异构动态移动通信网络的延时优化

国家自然科学基金

2+阅读 · 2013年12月31日

P2P-Grid环境中分布式不确定本体模型的研究

国家自然科学基金

0+阅读 · 2013年12月31日

无线网络中跨周界环境区分的研究

国家自然科学基金

0+阅读 · 2012年12月31日

无人车越野环境感知关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

大质量恒星形成环境的红外亚毫米波观测新研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于移动Agent的无线传感器网络数据处理技术研究

国家自然科学基金

0+阅读 · 2011年12月31日

Internet环境中基于语义Web的开放式决策支持系统关键技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

传感器网络下分布式多目标跟踪方法研究

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员