如何训练你的顾问模型：利用顾问模型引导黑盒大语言模型 (How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models)

Frontier language models are deployed as black-box services, where model weights cannot be modified and customization is limited to prompting. We introduce Advisor Models, a method to train small open-weight models to generate dynamic, per-instance natural language advice that improves the capabilities of black-box frontier models. Advisor Models improve GPT-5's performance on RuleArena (Taxes) by 71%, reduce Gemini 3 Pro's steps taken in SWE agent tasks by 24.6%, and outperform static prompt optimizers in personalizing GPT-5 to user preferences (85-100% vs. 40-60%). We also find that advisors are transferable: an advisor trained with a low-cost student model still transfers improvements to a frontier model. Moreover, Advisor Models are robust: we observe no degradation on other benchmarks than the pipeline is trained on. Our method shows how to perform parametric optimization for black-box frontier models in a practical and cost-effective way.

翻译：前沿语言模型通常以黑盒服务形式部署，其模型权重无法修改，定制化仅限于提示工程。我们提出顾问模型方法，通过训练小型开源权重模型来生成动态的、针对每个实例的自然语言建议，从而提升黑盒前沿模型的能力。顾问模型将GPT-5在RuleArena（税收）任务上的性能提升71%，使Gemini 3 Pro在SWE智能体任务中的步骤数减少24.6%，并在个性化GPT-5适应用户偏好方面超越静态提示优化器（85-100% vs. 40-60%）。我们还发现顾问模型具有可迁移性：使用低成本学生模型训练的顾问模型仍能将改进迁移至前沿模型。此外，顾问模型具有鲁棒性：在训练流程未涉及的其他基准测试中未观察到性能下降。本方法展示了如何以实用且经济高效的方式对黑盒前沿模型进行参数化优化。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

【新书】大语言模型如何工作？200页pdf

专知会员服务

60+阅读 · 2025年6月20日

如何将领域知识注入大模型？最新《将领域特定知识注入大语言模型》综述

专知会员服务

79+阅读 · 2025年2月24日

Llama-3-SynE：实现有效且高效的大语言模型持续预训练

专知会员服务

36+阅读 · 2024年7月30日

掌握使用Python的大型语言模型

专知会员服务

63+阅读 · 2024年5月22日