Local Search, Semantics, and Genetic Programming: a Global Analysis

Geometric Semantic Geometric Programming (GSGP) is one of the most prominent Genetic Programming (GP) variants, thanks to its solid theoretical background, the excellent performance achieved, and the execution time significantly smaller than standard syntax-based GP. In recent years, a new mutation operator, Geometric Semantic Mutation with Local Search (GSM-LS), has been proposed to include a local search step in the mutation process based on the idea that performing a linear regression during the mutation can allow for a faster convergence to good-quality solutions. While GSM-LS helps the convergence of the evolutionary search, it is prone to overfitting. Thus, it was suggested to use GSM-LS only for a limited number of generations and, subsequently, to switch back to standard geometric semantic mutation. A more recently defined variant of GSGP (called GSGP-reg) also includes a local search step but shares similar strengths and weaknesses with GSM-LS. Here we explore multiple possibilities to limit the overfitting of GSM-LS and GSGP-reg, ranging from adaptive methods to estimate the risk of overfitting at each mutation to a simple regularized regression. The results show that the method used to limit overfitting is not that important: providing that a technique to control overfitting is used, it is possible to consistently outperform standard GSGP on both training and unseen data. The obtained results allow practitioners to better understand the role of local search in GSGP and demonstrate that simple regularization strategies are effective in controlling overfitting.

翻译：几何语义遗传编程（GSGP）是最突出的遗传编程（GP）变体之一，这得益于其坚实的理论基础、优异的性能表现以及显著小于基于标准语法的遗传编程的执行时间。近年来，一种新型变异算子——带局部搜索的几何语义变异（GSM-LS）被提出，该算子在变异过程中引入局部搜索步骤，其核心思想是在变异期间执行线性回归可更快地收敛至高质量解。尽管GSM-LS有助于进化搜索的收敛，但它容易陷入过拟合。因此，建议仅在有限代数内使用GSM-LS，随后切换回标准几何语义变异。近期定义的GSGP变体（称为GSGP-reg）同样包含局部搜索步骤，但与GSM-LS具有相似的优缺点。本文探索了多种限制GSM-LS和GSGP-reg过拟合的方法，范围从自适应评估每次变异过拟合风险的策略到简单的正则化回归。结果表明，限制过拟合的具体方法并非关键：只要采用控制过拟合的技术，就能够在训练数据和未见数据上持续优于标准GSGP。所得结果使实践者更深入理解局部搜索在GSGP中的作用，并证明简单的正则化策略能有效控制过拟合。

相关内容

过拟合

关注 8

过拟合，在AI领域多指机器学习得到模型太过复杂，导致在训练集上表现很好，然而在测试集上却不尽人意。过拟合（over-fitting）也称为过学习，它的直观表现是算法在训练集上表现好，但在测试集上表现不好，泛化性能差。过拟合是在模型参数拟合过程中由于训练数据包含抽样误差，在训练时复杂的模型将抽样误差也进行了拟合导致的。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日