We present FiMI (Finance Model for India), a domain-specialized financial language model developed by National Payments Corporation of India (NPCI) for Indian digital payment systems. We develop two model variants: FiMI Base and FiMI Instruct. FiMI adapts the Mistral Small 24B architecture through a multi-stage training pipeline, beginning with continuous pre-training on 68 Billion tokens of curated financial, multilingual (English, Hindi, Hinglish), and synthetic data. This is followed by instruction fine-tuning and domain-specific supervised fine-tuning focused on multi-turn, tool-driven conversations that model real-world workflows, such as transaction disputes and mandate lifecycle management. Evaluations reveal that FiMI Base achieves a 20\% improvement over the Mistral Small 24B Base model on finance reasoning benchmark, while FiMI Instruct outperforms the Mistral Small 24B Instruct model by 87\% on domain-specific tool-calling. Moreover, FiMI achieves these significant domain gains while maintaining comparable performance to models of similar size on general benchmarks.
翻译:本文介绍FiMI(印度金融模型),这是由印度国家支付公司(NPCI)为印度数字支付系统开发的领域专用金融语言模型。我们开发了两种模型变体:FiMI Base 和 FiMI Instruct。FiMI 基于 Mistral Small 24B 架构,通过多阶段训练流程进行适配:首先使用680亿个经过筛选的金融、多语言(英语、印地语、印英混合语)及合成数据令牌进行持续预训练;随后进行指令微调以及专注于多轮次、工具驱动的对话的领域特定监督微调,此类对话模拟了真实世界的工作流程,例如交易争议和授权生命周期管理。评估结果表明,在金融推理基准测试中,FiMI Base 相比 Mistral Small 24B Base 模型实现了20%的性能提升;而在领域特定的工具调用任务上,FiMI Instruct 的性能比 Mistral Small 24B Instruct 模型高出87%。此外,FiMI 在取得这些显著领域性能提升的同时,在通用基准测试上仍保持了与同规模模型相当的性能。