揭秘面向金融大语言模型的领域自适应后训练 (Demystifying Domain-adaptive Post-training for Financial LLMs)

Domain-adaptive post-training of large language models (LLMs) has emerged as a promising approach for specialized domains such as medicine and finance. However, significant challenges remain in identifying optimal adaptation criteria and training strategies across varying data and model configurations. To address these challenges, we introduce FINDAP, a systematic and fine-grained investigation into domain-adaptive post-training of LLMs for the finance domain. Our approach consists of four key components: FinCap, which defines the core capabilities required for the target domain; FinRec, an effective training recipe that jointly optimizes continual pre-training and instruction-following, along with a novel preference data distillation method leveraging process signals from a generative reward model; FinTrain, a curated set of training datasets supporting FinRec; and FinEval, a comprehensive evaluation suite aligned with FinCap. The resulting model, Llama-Fin, achieves state-of-the-art performance across a wide range of financial tasks. Our analysis also highlights how each post-training stage contributes to distinct capabilities, uncovering specific challenges and effective solutions, providing valuable insights for domain adaptation of LLMs

翻译：大型语言模型（LLM）的领域自适应后训练已成为医学、金融等专业领域的一种前景广阔的方法。然而，在不同数据和模型配置下，如何确定最优的适应标准和训练策略仍面临重大挑战。为应对这些挑战，我们提出了FINDAP，一项针对金融领域LLM领域自适应后训练的系统性、细粒度研究。我们的方法包含四个关键组成部分：FinCap，定义了目标领域所需的核心能力；FinRec，一种联合优化持续预训练与指令跟随的有效训练方案，以及一种利用生成式奖励模型过程信号的新型偏好数据蒸馏方法；FinTrain，一套支持FinRec的精选训练数据集；以及FinEval，一个与FinCap对齐的综合性评估套件。由此得到的模型Llama-Fin，在广泛的金融任务中实现了最先进的性能。我们的分析还揭示了每个后训练阶段如何贡献于不同的能力，发现了具体的挑战和有效的解决方案，为LLM的领域适应提供了宝贵的见解。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日