Foundation Models (FMs) such as GPT-4 encoded with vast knowledge and powerful emergent abilities have achieved remarkable success in various natural language processing and computer vision tasks. Grounding FMs by adapting them to domain-specific tasks or augmenting them with domain-specific knowledge enables us to exploit the full potential of FMs. However, grounding FMs faces several challenges, stemming primarily from constrained computing resources, data privacy, model heterogeneity, and model ownership. Federated Transfer Learning (FTL), the combination of federated learning and transfer learning, provides promising solutions to address these challenges. In recent years, the need for grounding FMs leveraging FTL, coined FTL-FM, has arisen strongly in both academia and industry. Motivated by the strong growth in FTL-FM research and the potential impact of FTL-FM on industrial applications, we propose an FTL-FM framework that formulates problems of grounding FMs in the federated learning setting, construct a detailed taxonomy based on the FTL-FM framework to categorize state-of-the-art FTL-FM works, and comprehensively overview FTL-FM works based on the proposed taxonomy. We also establish correspondences between FTL-FM and conventional phases of adapting FM so that FM practitioners can align their research works with FTL-FM. In addition, we overview advanced efficiency-improving and privacy-preserving techniques because efficiency and privacy are critical concerns in FTL-FM. Last, we discuss opportunities and future research directions of FTL-FM.
翻译:基础模型(如GPT-4)凭借其海量知识编码与强大的涌现能力,已在自然语言处理和计算机视觉任务中取得显著成功。通过适配领域特定任务或增强领域知识来"接地"基础模型,能够释放其全部潜力。然而,基础模型接地面临着由计算资源限制、数据隐私、模型异构性及模型所有权引发的多重挑战。作为联邦学习与迁移学习的融合,联邦迁移学习(FTL)为解决这些挑战提供了可行方案。近年来,学术界与工业界对利用FTL实现基础模型接地(简称FTL-FM)的需求日益迫切。基于FTL-FM研究的蓬勃发展与其对工业应用的潜在影响,我们提出一种FTL-FM框架:该框架在联邦学习场景中形式化基础模型接地问题,构建基于FTL-FM框架的细粒度分类体系以归类现有前沿FTL-FM研究,并基于所提分类体系进行系统性综述。通过建立FTL-FM与传统基础模型适配阶段间的对应关系,使基础模型从业者能够将自身研究与FTL-FM对齐。此外,鉴于效率与隐私是FTL-FM的关键考量,我们综述了先进的效率提升与隐私保护技术。最后,探讨了FTL-FM的机遇与未来研究方向。