SCOPE: Cost-Efficient Model Selection for Compound AI Systems under Quality Constraints - 专知论文

会员服务 ·

0

代价 · AI · MoDELS · 约束 · 阈值 ·

SCOPE: Cost-Efficient Model Selection for Compound AI Systems under Quality Constraints

翻译：暂无翻译

Yiqian Huang,Shiqi Zhang,Tianyuan Jin,Xiaokui Xiao

from arxiv, Technical report for the paper accepted at KDD 2026

A compound AI system consists of multiple LLM modules, together handling complex and multi-step tasks that exceed the capabilities of a single model. Existing systems often use a single expensive LLM across all modules to improve the result quality of the whole system. However, this configuration incurs prohibitive costs, particularly for data management and analytics tasks at scale, such as data manipulation. To this end, we formalize the problem of constrained LLM selection for compound AI systems, leveraging the diverse pricing and capabilities of different LLMs to achieve competitive quality at lower cost. Given a query dataset and a user-specified quality threshold, we aim to select an LLM for each module to minimize the system's average cost while ensuring that overall quality meets the required threshold. To solve this problem, we propose SCOPE, a cost-efficient optimization algorithm. Unlike existing approaches that rely on expensive dataset-level evaluations, SCOPE exploits per-query results to rapidly estimate the system's cost and quality, and constructs confidence bounds to guide the search for promising LLM combinations. Furthermore, SCOPE provides theoretical guarantees for meeting the quality threshold and achieving near-optimal average cost. We evaluate SCOPE against 7 baselines on three data processing tasks, demonstrating that it outperforms all baselines. Under the same search budget and quality constraint, it finds solutions with up to $20\times$ lower cost than the best competitor during the search and achieves up to $6\times$ lower final cost in the returned solution.

翻译：暂无翻译

0

相关内容

AgentOps综述：智能体系统运维框架

AgentOps综述：智能体系统运维框架

专知会员服务

19+阅读 · 6月4日

【ICML2026】MASPO：面向基于大语言模型的多智能体系统的联合提示词优化

【ICML2026】MASPO：面向基于大语言模型的多智能体系统的联合提示词优化

专知会员服务

12+阅读 · 5月9日

构建面向终端的 AI 编程智能体：脚手架、测试环境、上下文工程及实践经验

构建面向终端的 AI 编程智能体：脚手架、测试环境、上下文工程及实践经验

专知会员服务

25+阅读 · 3月8日

《多智能体大语言模型系统的可靠决策研究》

《多智能体大语言模型系统的可靠决策研究》

专知会员服务

41+阅读 · 2月2日

AI 智能体系统：体系架构、应用场景及评估范式

AI 智能体系统：体系架构、应用场景及评估范式

专知会员服务

70+阅读 · 1月6日

迈向智能体系统规模化的科学

迈向智能体系统规模化的科学

专知会员服务

22+阅读 · 2025年12月12日

【AAAI2026】AutoTool：面向大语言模型智能体的高效工具选择方法

【AAAI2026】AutoTool：面向大语言模型智能体的高效工具选择方法

专知会员服务

19+阅读 · 2025年11月19日

面向应用的智能体 AI 系统价值对齐：综述与展望

面向应用的智能体 AI 系统价值对齐：综述与展望

专知会员服务

27+阅读 · 2025年6月12日

中文版 | 集中式与分布式多智能体AI协调策略

中文版 | 集中式与分布式多智能体AI协调策略

专知会员服务

22+阅读 · 2025年5月8日

《生成式人工智能（AI）在系统工程设计中的未来考虑》25页slides

《生成式人工智能（AI）在系统工程设计中的未来考虑》25页slides

专知会员服务

39+阅读 · 2025年1月16日

【ChatGPT系列报告】人工智能行业专题报告：多模态AI研究框架，17页ppt

【ChatGPT系列报告】人工智能行业专题报告：多模态AI研究框架，17页ppt

专知

23+阅读 · 2023年4月8日

AAAI 2020 | 中科大：智能教育系统中的神经认知诊断，从数据中学习交互函数

AAAI 2020 | 中科大：智能教育系统中的神经认知诊断，从数据中学习交互函数

AI科技评论

24+阅读 · 2020年1月11日

面向人工智能的计算机体系结构

面向人工智能的计算机体系结构

计算机研究与发展

14+阅读 · 2019年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【人工智能】智能计算概述、神经网络计算、机器学习计算、遗传算法、模糊计算、群智能计算

【人工智能】智能计算概述、神经网络计算、机器学习计算、遗传算法、模糊计算、群智能计算

产业智能官

15+阅读 · 2019年1月8日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

AI综述专栏|跨领域推荐系统文献综述（下）

AI综述专栏|跨领域推荐系统文献综述（下）

人工智能前沿讲习班

14+阅读 · 2018年5月18日

AI综述专栏 | 跨领域推荐系统文献综述（上）

AI综述专栏 | 跨领域推荐系统文献综述（上）

人工智能前沿讲习班

13+阅读 · 2018年5月16日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

基于格型结构与CS理论的高效数字系统设计与实现研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向数万处理器的有限元线性方程组与模态多级算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

输入约束下的多智能体系统完全分布式协调控制研究

国家自然科学基金

5+阅读 · 2015年12月31日

具有动态不确定性的下三角多智能体系统分布式自适应协同控制

国家自然科学基金

3+阅读 · 2015年12月31日

基于动态增益非线性干扰观测器的多智能体系统协调跟踪和干扰抑制

国家自然科学基金

1+阅读 · 2015年12月31日

带有输入饱和的多智能体系统的包含控制研究

国家自然科学基金

1+阅读 · 2015年12月31日

多智能体系统有限时间一致性的自适应控制研究

国家自然科学基金

3+阅读 · 2015年12月31日

带有通信量化和延时的多智能体系统一致性研究

国家自然科学基金

0+阅读 · 2014年12月31日

多智能体系统的可控性与群可控性研究

国家自然科学基金

10+阅读 · 2013年12月31日

基于动态分层与自学习的多智能体自适应协作模型

国家自然科学基金

17+阅读 · 2008年12月31日

Explainable AI in Speaker Recognition -- Attention Map Visualisation and Evaluation

Arxiv

0+阅读 · 6月22日

Using predictive multiplicity to measure individual performance within the AI Act

Arxiv

0+阅读 · 6月21日

Load Testing for Machine Learning Model Serving Systems at Scale

Arxiv

0+阅读 · 6月20日

Learning Burst-Aware Early Warning Models for Capacity Stress under AI Workload Surges in Hyperscale Data Centers

Arxiv

0+阅读 · 6月19日

Analyzing Defensive Misdirection Against Model-Guided Automated Attacks on Agentic AI Systems

Analyzing Defensive Misdirection Against Model-Guided Automated Attacks on Agentic AI Systems

Arxiv

0+阅读 · 6月18日

Open Weight AI Models Require Proportional Evaluation Approaches

Arxiv

0+阅读 · 6月18日

A Clinician-Centered Pipeline for Annotation and Evaluation in Ultrasound AI Studies

Arxiv

0+阅读 · 6月17日

Skill-MAS: Evolving Meta-Skill for Automatic Multi-Agent Systems

Arxiv

0+阅读 · 6月17日

AI Sandboxes: A Threat Model, Taxonomy, and Measurement Framework

Arxiv

0+阅读 · 6月16日

On the Reliability of Networks of AI Agents: Density Evolution, Stopping Sets, and Architecture Optimization

Arxiv

0+阅读 · 6月16日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

1+阅读 · 今天14:45

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

1+阅读 · 今天14:43

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

3+阅读 · 今天14:31

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

3+阅读 · 今天14:20

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

2+阅读 · 今天14:11

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

3+阅读 · 今天14:07

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

3+阅读 · 今天14:03

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

2+阅读 · 今天13:59

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

5+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

8+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

7+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

5+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

8+阅读 · 6月22日

相关VIP内容

AgentOps综述：智能体系统运维框架

AgentOps综述：智能体系统运维框架

专知会员服务

19+阅读 · 6月4日

【ICML2026】MASPO：面向基于大语言模型的多智能体系统的联合提示词优化

【ICML2026】MASPO：面向基于大语言模型的多智能体系统的联合提示词优化

专知会员服务

12+阅读 · 5月9日

构建面向终端的 AI 编程智能体：脚手架、测试环境、上下文工程及实践经验

构建面向终端的 AI 编程智能体：脚手架、测试环境、上下文工程及实践经验

专知会员服务

25+阅读 · 3月8日

《多智能体大语言模型系统的可靠决策研究》

《多智能体大语言模型系统的可靠决策研究》

专知会员服务

41+阅读 · 2月2日

AI 智能体系统：体系架构、应用场景及评估范式

AI 智能体系统：体系架构、应用场景及评估范式

专知会员服务

70+阅读 · 1月6日

迈向智能体系统规模化的科学

迈向智能体系统规模化的科学

专知会员服务

22+阅读 · 2025年12月12日

【AAAI2026】AutoTool：面向大语言模型智能体的高效工具选择方法

【AAAI2026】AutoTool：面向大语言模型智能体的高效工具选择方法

专知会员服务

19+阅读 · 2025年11月19日

面向应用的智能体 AI 系统价值对齐：综述与展望

面向应用的智能体 AI 系统价值对齐：综述与展望

专知会员服务

27+阅读 · 2025年6月12日

中文版 | 集中式与分布式多智能体AI协调策略

中文版 | 集中式与分布式多智能体AI协调策略

专知会员服务

22+阅读 · 2025年5月8日

《生成式人工智能（AI）在系统工程设计中的未来考虑》25页slides

《生成式人工智能（AI）在系统工程设计中的未来考虑》25页slides

专知会员服务

39+阅读 · 2025年1月16日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 世界动作模型：少做梦，多行动

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

美以伊冲突：无人机与人工智能的运用

相关资讯

【ChatGPT系列报告】人工智能行业专题报告：多模态AI研究框架，17页ppt

【ChatGPT系列报告】人工智能行业专题报告：多模态AI研究框架，17页ppt

专知

23+阅读 · 2023年4月8日

AAAI 2020 | 中科大：智能教育系统中的神经认知诊断，从数据中学习交互函数

AAAI 2020 | 中科大：智能教育系统中的神经认知诊断，从数据中学习交互函数

AI科技评论

24+阅读 · 2020年1月11日

面向人工智能的计算机体系结构

面向人工智能的计算机体系结构

计算机研究与发展

14+阅读 · 2019年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【人工智能】智能计算概述、神经网络计算、机器学习计算、遗传算法、模糊计算、群智能计算

【人工智能】智能计算概述、神经网络计算、机器学习计算、遗传算法、模糊计算、群智能计算

产业智能官

15+阅读 · 2019年1月8日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

AI综述专栏|跨领域推荐系统文献综述（下）

AI综述专栏|跨领域推荐系统文献综述（下）

人工智能前沿讲习班

14+阅读 · 2018年5月18日

AI综述专栏 | 跨领域推荐系统文献综述（上）

AI综述专栏 | 跨领域推荐系统文献综述（上）

人工智能前沿讲习班

13+阅读 · 2018年5月16日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

相关论文

Explainable AI in Speaker Recognition -- Attention Map Visualisation and Evaluation

Arxiv

0+阅读 · 6月22日

Using predictive multiplicity to measure individual performance within the AI Act

Arxiv

0+阅读 · 6月21日

Load Testing for Machine Learning Model Serving Systems at Scale

Arxiv

0+阅读 · 6月20日

Learning Burst-Aware Early Warning Models for Capacity Stress under AI Workload Surges in Hyperscale Data Centers

Arxiv

0+阅读 · 6月19日

Analyzing Defensive Misdirection Against Model-Guided Automated Attacks on Agentic AI Systems

Analyzing Defensive Misdirection Against Model-Guided Automated Attacks on Agentic AI Systems

Arxiv

0+阅读 · 6月18日

Open Weight AI Models Require Proportional Evaluation Approaches

Arxiv

0+阅读 · 6月18日

A Clinician-Centered Pipeline for Annotation and Evaluation in Ultrasound AI Studies

Arxiv

0+阅读 · 6月17日

Skill-MAS: Evolving Meta-Skill for Automatic Multi-Agent Systems

Arxiv

0+阅读 · 6月17日

AI Sandboxes: A Threat Model, Taxonomy, and Measurement Framework

Arxiv

0+阅读 · 6月16日

On the Reliability of Networks of AI Agents: Density Evolution, Stopping Sets, and Architecture Optimization

Arxiv

0+阅读 · 6月16日

相关基金

基于格型结构与CS理论的高效数字系统设计与实现研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向数万处理器的有限元线性方程组与模态多级算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

输入约束下的多智能体系统完全分布式协调控制研究

国家自然科学基金

5+阅读 · 2015年12月31日

具有动态不确定性的下三角多智能体系统分布式自适应协同控制

国家自然科学基金

3+阅读 · 2015年12月31日

基于动态增益非线性干扰观测器的多智能体系统协调跟踪和干扰抑制

国家自然科学基金

1+阅读 · 2015年12月31日

带有输入饱和的多智能体系统的包含控制研究

国家自然科学基金

1+阅读 · 2015年12月31日

多智能体系统有限时间一致性的自适应控制研究

国家自然科学基金

3+阅读 · 2015年12月31日

带有通信量化和延时的多智能体系统一致性研究

国家自然科学基金

0+阅读 · 2014年12月31日

多智能体系统的可控性与群可控性研究

国家自然科学基金

10+阅读 · 2013年12月31日

基于动态分层与自学习的多智能体自适应协作模型

国家自然科学基金

17+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员