MCTS-GEB: Monte Carlo Tree Search is a Good E-graph Builder - 专知论文

会员服务 ·

0

蒙特卡罗树搜索 · 蒙特卡罗 · 构建 · 饱和 · 搜索 ·

2023 年 4 月 14 日

MCTS-GEB: Monte Carlo Tree Search is a Good E-graph Builder

翻译：MCTS-GEB：蒙特卡洛树搜索是一种优秀的E-graph构建器

Guoliang He,Zak Singh,Eiko Yoneki

Rewrite systems [6, 10, 12] have been widely employing equality saturation [9], which is an optimisation methodology that uses a saturated e-graph to represent all possible sequences of rewrite simultaneously, and then extracts the optimal one. As such, optimal results can be achieved by avoiding the phase-ordering problem. However, we observe that when the e-graph is not saturated, it cannot represent all possible rewrite opportunities and therefore the phase-ordering problem is re-introduced during the construction phase of the e-graph. To address this problem, we propose MCTS-GEB, a domain-general rewrite system that applies reinforcement learning (RL) to e-graph construction. At its core, MCTS-GEB uses a Monte Carlo Tree Search (MCTS) [3] to efficiently plan for the optimal e-graph construction, and therefore it can effectively eliminate the phase-ordering problem at the construction phase and achieve better performance within a reasonable time. Evaluation in two different domains shows MCTS-GEB can outperform the state-of-the-art rewrite systems by up to 49x, while the optimisation can generally take less than an hour, indicating MCTS-GEB is a promising building block for the future generation of rewrite systems.

翻译：重写系统[6, 10, 12]已广泛采用等式饱和[9]技术，这是一种优化方法，通过使用饱和的e-graph同时表示所有可能的重写序列，并提取最优结果。因此，通过避免阶段排序问题可实现最优结果。然而，我们观察到当e-graph未饱和时，它无法表示所有可能的重写机会，因此在e-graph构建阶段会重新引入阶段排序问题。为应对此问题，我们提出MCTS-GEB——一种将强化学习（RL）应用于e-graph构建的通用域重写系统。其核心思想是使用蒙特卡洛树搜索（MCTS）[3]高效规划最优的e-graph构建，从而有效消除构建阶段的阶段排序问题，并在合理时间内取得更优性能。在两个不同领域的评估表明，MCTS-GEB可将性能提升至现有最先进重写系统的49倍，且优化过程通常可在1小时内完成，这表明MCTS-GEB是未来生成重写系统的一种极具前景的基础组件。

0

相关内容

蒙特卡罗树搜索

蒙特卡罗树搜索

148页最新《深度强化学习》教程，148页ppt

148页最新《深度强化学习》教程，148页ppt

专知会员服务

77+阅读 · 2023年4月29日

【CMU博士论文】分布式强化学习自动驾驶，100页pdf

【CMU博士论文】分布式强化学习自动驾驶，100页pdf

专知会员服务

37+阅读 · 2023年4月17日

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

专知会员服务

41+阅读 · 2022年10月10日

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

专知会员服务

22+阅读 · 2022年3月18日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

直线扫描内CT及其动态Bowtie研究

国家自然科学基金

0+阅读 · 2014年12月31日

溶剂热法FeSe基超导材料制备和物性研究

国家自然科学基金

0+阅读 · 2014年12月31日

地理格局及生态驱动揭示肉苁蓉品质生态型机理

国家自然科学基金

0+阅读 · 2014年12月31日

脑出血早期血肿扩大机制的CT densitometry研究

国家自然科学基金

0+阅读 · 2012年12月31日

miR-182通过MET和CTTN基因及其相关信号通路抑制肺癌转移的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

构建面向Web的、以实体为中心的知识库的关键技术研究

国家自然科学基金

7+阅读 · 2012年12月31日

鸭疫里默氏杆菌整合子对其捕获和表达耐药基因盒效率的调控作用

国家自然科学基金

0+阅读 · 2012年12月31日

头颈部多排探测器螺旋CT辐射剂量的系统性优化

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

PERFOGRAPH: A Numerical Aware Program Graph Representation for Performance Optimization and Program Analysis

Arxiv

0+阅读 · 2023年5月31日

AccMER: Accelerating Multi-Agent Experience Replay with Cache Locality-aware Prioritization

Arxiv

0+阅读 · 2023年5月31日

Physics-Informed Ensemble Representation for Light-Field Image Super-Resolution

Arxiv

0+阅读 · 2023年5月31日

Learning to solve Bayesian inverse problems: An amortized variational inference approach

Arxiv

0+阅读 · 2023年5月31日

Characterizing Off-path SmartNIC for Accelerating Distributed Systems

Arxiv

0+阅读 · 2023年5月31日

Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network

Arxiv

0+阅读 · 2023年5月30日

Parameter estimation from aggregate observations: A Wasserstein distance based sequential Monte Carlo sampler

Arxiv

0+阅读 · 2023年5月30日

Voxel2Hemodynamics: An End-to-end Deep Learning Method for Predicting Coronary Artery Hemodynamics

Arxiv

0+阅读 · 2023年5月30日

Elongated Physiological Structure Segmentation via Spatial and Scale Uncertainty-aware Network

Arxiv

0+阅读 · 2023年5月30日

From Knowledge Graph Embedding to Ontology Embedding: Region Based Representations of Relational Structures

Arxiv

10+阅读 · 2018年5月26日

VIP会员

文章信息

相关主题

蒙特卡罗树搜索

最新内容

ICML 2026 Oral｜大模型为何难被提示纠正？内部先验限制标注适应性

ICML 2026 Oral｜大模型为何难被提示纠正？内部先验限制标注适应性

专知会员服务

1+阅读 · 6月8日

CVPR 2026教程：统一多模态模型走向收敛之路

CVPR 2026教程：统一多模态模型走向收敛之路

专知会员服务

2+阅读 · 6月8日

《人工智能在网络防御中的机遇》

《人工智能在网络防御中的机遇》

专知会员服务

5+阅读 · 6月8日

认知战：定义与能力发展

认知战：定义与能力发展

专知会员服务

4+阅读 · 6月8日

2026年美国防部人工智能政策如何将国防人工智能转向速度、规模与“人工智能优先”作战

2026年美国防部人工智能政策如何将国防人工智能转向速度、规模与“人工智能优先”作战

专知会员服务

6+阅读 · 6月8日

《伊朗-以色列对抗中的算法化目标选定：技术现实、法律门槛与人类控制的边界》

《伊朗-以色列对抗中的算法化目标选定：技术现实、法律门槛与人类控制的边界》

专知会员服务

4+阅读 · 6月8日

《红外图像中掩埋目标检测的深度学习方法》2026最新报告

《红外图像中掩埋目标检测的深度学习方法》2026最新报告

专知会员服务

4+阅读 · 6月8日

《小部队领导者运用新技术训练与制胜指南》2026最新50页

《小部队领导者运用新技术训练与制胜指南》2026最新50页

专知会员服务

5+阅读 · 6月8日

乌军利用美国“黄蜂”无人机摧毁俄军后勤

乌军利用美国“黄蜂”无人机摧毁俄军后勤

专知会员服务

7+阅读 · 6月7日

《支持作战级人机协同智能的交互式OODA流程》

《支持作战级人机协同智能的交互式OODA流程》

专知会员服务

15+阅读 · 6月7日

《军事地面机动的概率等时分析：未来自适应模型的多方法协同》

《军事地面机动的概率等时分析：未来自适应模型的多方法协同》

专知会员服务

7+阅读 · 6月7日

大语言模型与物联网：大语言模型与物联网融合全面综述

大语言模型与物联网：大语言模型与物联网融合全面综述

专知会员服务

12+阅读 · 6月7日

【伯克利博士论文】基于动作分块策略的强化学习

【伯克利博士论文】基于动作分块策略的强化学习

专知会员服务

7+阅读 · 6月7日

Transformer增强强化学习：通信网络基础与应用综述

Transformer增强强化学习：通信网络基础与应用综述

专知会员服务

7+阅读 · 6月7日

ICML 2026 | SARDI：扩散语言模型的自增强检索

ICML 2026 | SARDI：扩散语言模型的自增强检索

专知会员服务

8+阅读 · 6月6日

相关VIP内容

148页最新《深度强化学习》教程，148页ppt

148页最新《深度强化学习》教程，148页ppt

专知会员服务

77+阅读 · 2023年4月29日

【CMU博士论文】分布式强化学习自动驾驶，100页pdf

【CMU博士论文】分布式强化学习自动驾驶，100页pdf

专知会员服务

37+阅读 · 2023年4月17日

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

手册《兵棋推演：工具、技术和程序》33页slides，Connections UK – Wargaming for Professionals

专知会员服务

41+阅读 · 2022年10月10日

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

专知会员服务

22+阅读 · 2022年3月18日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

CVPR 2026教程：统一多模态模型走向收敛之路

认知战：定义与能力发展

ICML 2026 Oral｜大模型为何难被提示纠正？内部先验限制标注适应性

《人工智能在网络防御中的机遇》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

PERFOGRAPH: A Numerical Aware Program Graph Representation for Performance Optimization and Program Analysis

Arxiv

0+阅读 · 2023年5月31日

AccMER: Accelerating Multi-Agent Experience Replay with Cache Locality-aware Prioritization

Arxiv

0+阅读 · 2023年5月31日

Physics-Informed Ensemble Representation for Light-Field Image Super-Resolution

Arxiv

0+阅读 · 2023年5月31日

Learning to solve Bayesian inverse problems: An amortized variational inference approach

Arxiv

0+阅读 · 2023年5月31日

Characterizing Off-path SmartNIC for Accelerating Distributed Systems

Arxiv

0+阅读 · 2023年5月31日

Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network

Arxiv

0+阅读 · 2023年5月30日

Parameter estimation from aggregate observations: A Wasserstein distance based sequential Monte Carlo sampler

Arxiv

0+阅读 · 2023年5月30日

Voxel2Hemodynamics: An End-to-end Deep Learning Method for Predicting Coronary Artery Hemodynamics

Arxiv

0+阅读 · 2023年5月30日

Elongated Physiological Structure Segmentation via Spatial and Scale Uncertainty-aware Network

Arxiv

0+阅读 · 2023年5月30日

From Knowledge Graph Embedding to Ontology Embedding: Region Based Representations of Relational Structures

Arxiv

10+阅读 · 2018年5月26日

相关基金

MARVELD1基因调控肝细胞癌介入治疗的机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

直线扫描内CT及其动态Bowtie研究

国家自然科学基金

0+阅读 · 2014年12月31日

溶剂热法FeSe基超导材料制备和物性研究

国家自然科学基金

0+阅读 · 2014年12月31日

地理格局及生态驱动揭示肉苁蓉品质生态型机理

国家自然科学基金

0+阅读 · 2014年12月31日

脑出血早期血肿扩大机制的CT densitometry研究

国家自然科学基金

0+阅读 · 2012年12月31日

miR-182通过MET和CTTN基因及其相关信号通路抑制肺癌转移的分子机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

构建面向Web的、以实体为中心的知识库的关键技术研究

国家自然科学基金

7+阅读 · 2012年12月31日

鸭疫里默氏杆菌整合子对其捕获和表达耐药基因盒效率的调控作用

国家自然科学基金

0+阅读 · 2012年12月31日

头颈部多排探测器螺旋CT辐射剂量的系统性优化

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员