RSRM: Reinforcement Symbolic Regression Machine

In nature, the behaviors of many complex systems can be described by parsimonious math equations. Automatically distilling these equations from limited data is cast as a symbolic regression process which hitherto remains a grand challenge. Keen efforts in recent years have been placed on tackling this issue and demonstrated success in symbolic regression. However, there still exist bottlenecks that current methods struggle to break when the discrete search space tends toward infinity and especially when the underlying math formula is intricate. To this end, we propose a novel Reinforcement Symbolic Regression Machine (RSRM) that masters the capability of uncovering complex math equations from only scarce data. The RSRM model is composed of three key modules: (1) a Monte Carlo tree search (MCTS) agent that explores optimal math expression trees consisting of pre-defined math operators and variables, (2) a Double Q-learning block that helps reduce the feasible search space of MCTS via properly understanding the distribution of reward, and (3) a modulated sub-tree discovery block that heuristically learns and defines new math operators to improve representation ability of math expression trees. Biding of these modules yields the state-of-the-art performance of RSRM in symbolic regression as demonstrated by multiple sets of benchmark examples. The RSRM model shows clear superiority over several representative baseline models.

翻译：自然界中，许多复杂系统的行为可通过简洁的数学方程加以描述。从有限数据中自动提取这些方程的过程被称为符号回归，而这一领域至今仍是一项重大挑战。近年来，研究者在解决该问题上投入了大量努力，并在符号回归中取得了成功。然而，当离散搜索空间趋于无穷大，尤其是底层数学公式极其复杂时，现有方法仍面临难以突破的瓶颈。为此，我们提出了一种新颖的强化符号回归机器（RSRM），它能够仅从稀疏数据中掌握发现复杂数学方程的能力。RSRM模型由三个关键模块组成：（1）蒙特卡洛树搜索（MCTS）智能体，用于探索由预定义数学运算符和变量构成的最优数学表达式树；（2）双Q学习模块，通过合理理解奖励分布来帮助缩小MCTS的可行搜索空间；（3）调制子树发现模块，通过启发式学习并定义新的数学运算符，提升数学表达式树的表示能力。这三个模块的协同作用使RSRM在符号回归中展现出最先进的性能，这在多个基准测试中得到了验证。RSRM模型在多个代表性基线模型上表现出明显的优越性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

专知会员服务

39+阅读 · 2020年11月3日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning