Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

Large models represent a groundbreaking advancement in multiple application fields, enabling remarkable achievements across various tasks. However, their unprecedented scale comes with significant computational costs. These models, often consisting of billions of parameters, require vast amounts of computational resources for execution. Especially, the expansive scale and computational demands pose considerable challenges when customizing them for particular downstream tasks, particularly over the hardware platforms constrained by computational capabilities. Parameter Efficient Fine-Tuning (PEFT) provides a practical solution by efficiently adapt the large models over the various downstream tasks. In particular, PEFT refers to the process of adjusting the parameters of a pre-trained large models to adapt it to a specific task while minimizing the number of additional parameters introduced or computational resources required. This approach is particularly important when dealing with large language models with high parameter counts, as fine-tuning these models from scratch can be computationally expensive and resource-intensive, posing considerable challenges in the supporting system platform design. In this survey, we present comprehensive studies of various PEFT algorithms, examining their performance and computational overhead. Moreover, we provide an overview of applications developed using different PEFT algorithms and discuss common techniques employed to mitigate computation costs for PEFT. In addition to the algorithmic perspective, we overview various real-world system designs to investigate the implementation costs associated with different PEFT algorithms. This survey serves as an indispensable resource for researchers aiming to understand both the PEFT algorithm and its system implementation, offering detailed insights into recent advancements and practical applications.

翻译：大模型代表了多个应用领域的突破性进展，在各种任务中取得了显著成就。然而，其前所未有的规模带来了高昂的计算成本。这些模型通常包含数十亿参数，需要海量计算资源才能运行。特别是，当针对特定下游任务进行定制时，尤其是在计算能力受限的硬件平台上，其庞大的规模和计算需求带来了巨大挑战。参数高效微调（PEFT）提供了一种实用解决方案，通过高效地使大模型适应各种下游任务。具体而言，PEFT指的是调整预训练大模型参数以使其适应特定任务的过程，同时最小化引入的额外参数数量或所需计算资源。在处理具有高参数数量的大语言模型时，这种方法尤为重要，因为从头微调这些模型在计算上可能非常昂贵且资源密集，对支撑系统平台设计构成重大挑战。在本综述中，我们全面研究了多种PEFT算法，考察了它们的性能和计算开销。此外，我们概述了使用不同PEFT算法开发的应用，并讨论了常用于降低PEFT计算成本的技术。除了算法视角，我们还概述了多种实际系统设计，以研究不同PEFT算法相关的实现成本。本综述旨在成为研究人员理解PEFT算法及其系统实现不可或缺的资源，提供关于最新进展和实际应用的详细见解。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/