Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM

In conversational AI research, there's a noticeable trend towards developing models with a larger number of parameters, exemplified by models like ChatGPT. While these expansive models tend to generate increasingly better chat responses, they demand significant computational resources and memory. This study explores a pertinent question: Can a combination of smaller models collaboratively achieve comparable or enhanced performance relative to a singular large model? We introduce an approach termed "blending", a straightforward yet effective method of integrating multiple chat AIs. Our empirical evidence suggests that when specific smaller models are synergistically blended, they can potentially outperform or match the capabilities of much larger counterparts. For instance, integrating just three models of moderate size (6B/13B paramaeters) can rival or even surpass the performance metrics of a substantially larger model like ChatGPT (175B+ paramaters). This hypothesis is rigorously tested using A/B testing methodologies with a large user base on the Chai research platform over a span of thirty days. The findings underscore the potential of the "blending" strategy as a viable approach for enhancing chat AI efficacy without a corresponding surge in computational demands.

翻译：在对话式人工智能研究中，存在一个显著趋势：模型参数规模不断扩大，ChatGPT 等模型即为典型代表。尽管这类大规模模型往往能生成更优质的聊天响应，但其运行需要大量计算资源与内存。本研究探讨了一个关键问题：多个较小模型的协同组合能否达到或超越单个大模型的性能？我们提出一种名为“融合”（blending）的方法，这是一种整合多个聊天 AI 的简单而高效的策略。实验证据表明，当特定较小模型以协同方式融合时，其性能可能超越或媲美远大于自身的模型。例如，仅整合三个中等规模（60亿/130亿参数）的模型，即可与参数规模超1750亿的 ChatGPT 等大模型相匹敌甚至更优。该假设通过基于 Chai 研究平台的大规模用户群体（为期30天）的 A/B 测试方法进行了严格验证。研究结果凸显了“融合”策略的潜力：它能在不显著增加计算需求的前提下，有效提升聊天 AI 的性能。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/