Sharpness-Aware Minimization for Evolutionary Feature Construction in Regression

In recent years, genetic programming (GP)-based evolutionary feature construction has achieved significant success. However, a primary challenge with evolutionary feature construction is its tendency to overfit the training data, resulting in poor generalization on unseen data. In this research, we draw inspiration from PAC-Bayesian theory and propose using sharpness-aware minimization in function space to discover symbolic features that exhibit robust performance within a smooth loss landscape in the semantic space. By optimizing sharpness in conjunction with cross-validation loss, as well as designing a sharpness reduction layer, the proposed method effectively mitigates the overfitting problem of GP, especially when dealing with a limited number of instances or in the presence of label noise. Experimental results on 58 real-world regression datasets show that our approach outperforms standard GP as well as six state-of-the-art complexity measurement methods for GP in controlling overfitting. Furthermore, the ensemble version of GP with sharpness-aware minimization demonstrates superior performance compared to nine fine-tuned machine learning and symbolic regression algorithms, including XGBoost and LightGBM.

翻译：近年来，基于遗传编程（GP）的进化特征构建取得了显著成功。然而，进化特征构建面临的一个主要挑战是其容易过拟合训练数据，导致对未见数据的泛化能力较差。在本研究中，我们借鉴PAC-贝叶斯理论，提出在函数空间中采用锐度感知最小化，以发现语义空间中在平滑损失 landscape 上具有稳健性能的符号特征。通过结合交叉验证损失优化锐度，并设计锐度缩减层，所提出的方法有效缓解了GP的过拟合问题，特别是在样本数量有限或存在标签噪声的情况下。在58个真实回归数据集上的实验结果表明，我们的方法在控制过拟合方面优于标准GP以及六种最先进的GP复杂度度量方法。此外，采用锐度感知最小化的GP集成版本相较于包括XGBoost和LightGBM在内的九种精细调优的机器学习与符号回归算法，展现出更优越的性能。

相关内容

过拟合

关注 8

过拟合，在AI领域多指机器学习得到模型太过复杂，导致在训练集上表现很好，然而在测试集上却不尽人意。过拟合（over-fitting）也称为过学习，它的直观表现是算法在训练集上表现好，但在测试集上表现不好，泛化性能差。过拟合是在模型参数拟合过程中由于训练数据包含抽样误差，在训练时复杂的模型将抽样误差也进行了拟合导致的。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日