SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning

Automated Machine Learning (AutoML) approaches encompass traditional methods that optimize fixed pipelines for model selection and ensembling, as well as newer LLM-based frameworks that autonomously build pipelines. While LLM-based agents have shown promise in automating machine learning tasks, they often generate low-diversity and suboptimal code, even after multiple iterations. To overcome these limitations, we introduce Tree-Search Enhanced LLM Agents (SELA), an innovative agent-based system that leverages Monte Carlo Tree Search (MCTS) to optimize the AutoML process. By representing pipeline configurations as trees, our framework enables agents to conduct experiments intelligently and iteratively refine their strategies, facilitating a more effective exploration of the machine learning solution space. This novel approach allows SELA to discover optimal pathways based on experimental feedback, improving the overall quality of the solutions. In an extensive evaluation across 20 machine learning datasets, we compare the performance of traditional and agent-based AutoML methods, demonstrating that SELA achieves a win rate of 65% to 80% against each baseline across all datasets. These results underscore the significant potential of agent-based strategies in AutoML, offering a fresh perspective on tackling complex machine learning challenges.

翻译：自动化机器学习（AutoML）方法既包括优化固定流程以进行模型选择与集成的传统方法，也包含能够自主构建流程的新型基于大语言模型（LLM）的框架。尽管基于LLM的智能体在自动化机器学习任务中展现出潜力，但它们即使在多次迭代后仍常生成多样性不足且次优的代码。为克服这些限制，我们提出了基于树搜索增强的大语言模型智能体（SELA），这是一个创新的基于智能体的系统，利用蒙特卡洛树搜索（MCTS）来优化AutoML流程。通过将流程配置表示为树状结构，我们的框架使智能体能够智能地进行实验并迭代优化其策略，从而更有效地探索机器学习解空间。这种新颖的方法使SELA能够基于实验反馈发现最优路径，提升解决方案的整体质量。在涵盖20个机器学习数据集的广泛评估中，我们比较了传统方法与基于智能体的AutoML方法的性能，结果表明SELA在所有数据集上相对于各基线方法均取得了65%至80%的胜率。这些结果凸显了基于智能体的策略在AutoML中的巨大潜力，为应对复杂的机器学习挑战提供了新的视角。

相关内容

Machine Learning

关注 2251

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日