SPIO：基于LLM多智能体规划的自动数据科学集成与选择策略 (SPIO: Ensemble and Selective Strategies via LLM-Based Multi-Agent Planning in Automated Data Science)

Large Language Models (LLMs) have revolutionized automated data analytics and machine learning by enabling dynamic reasoning and adaptability. While recent approaches have advanced multi-stage pipelines through multi-agent systems, they typically rely on rigid, single-path workflows that limit the exploration and integration of diverse strategies, often resulting in suboptimal predictions. To address these challenges, we propose SPIO (Sequential Plan Integration and Optimization), a novel framework that leverages LLM-driven decision-making to orchestrate multi-agent planning across four key modules: data preprocessing, feature engineering, modeling, and hyperparameter tuning. In each module, dedicated planning agents independently generate candidate strategies that cascade into subsequent stages, fostering comprehensive exploration. A plan optimization agent refines these strategies by suggesting several optimized plans. We further introduce two variants: SPIO-S, which selects a single best solution path as determined by the LLM, and SPIO-E, which selects the top k candidate plans and ensembles them to maximize predictive performance. Extensive experiments on Kaggle and OpenML datasets demonstrate that SPIO significantly outperforms state-of-the-art methods, providing a robust and scalable solution for automated data science task.

翻译：大型语言模型（LLM）通过实现动态推理与自适应能力，彻底改变了自动化数据分析和机器学习领域。尽管近期研究通过多智能体系统推进了多阶段流程的发展，但这些方法通常依赖于僵化的单路径工作流，限制了对多样化策略的探索与整合，往往导致次优预测结果。为应对这些挑战，我们提出SPIO（顺序计划集成与优化）——一种创新框架，该框架利用LLM驱动的决策机制，在四个核心模块（数据预处理、特征工程、建模及超参数调优）中协调多智能体规划。在每个模块中，专用规划智能体独立生成候选策略，这些策略将级联传递至后续阶段，从而促进全面探索。计划优化智能体通过提出若干优化方案来精炼这些策略。我们进一步引入两种变体：SPIO-S（根据LLM判定选择单一最优解路径）与SPIO-E（选取前k个候选计划进行集成以最大化预测性能）。在Kaggle和OpenML数据集上的大量实验表明，SPIO显著优于现有最先进方法，为自动化数据科学任务提供了鲁棒且可扩展的解决方案。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日