Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Jimin Huang,Mengxi Xiao,Dong Li,Zihao Jiang,Yuzhe Yang,Yifei Zhang,Lingfei Qian,Yan Wang,Xueqing Peng,Yang Ren,Ruoyu Xiang,Zhengyu Chen,Xiao Zhang,Yueru He,Weiguang Han,Shunian Chen,Lihang Shen,Daniel Kim,Yangyang Yu,Yupeng Cao,Zhiyang Deng,Haohang Li,Duanyu Feng,Yongfu Dai,VijayaSai Somasundaram,Peng Lu,Guojun Xiong,Zhiwei Liu,Zheheng Luo,Zhiyuan Yao,Ruey-Ling Weng,Meikang Qiu,Kaleb E Smith,Honghai Yu,Yanzhao Lai,Min Peng,Jian-Yun Nie,Jordan W. Suchow,Xiao-Yang Liu,Benyou Wang,Alejandro Lopez-Lira,Qianqian Xie,Sophia Ananiadou,Junichi Tsujii

from arxiv, 33 pages, 13 figures

Financial LLMs hold promise for advancing financial tasks and domain-specific applications. However, they are limited by scarce corpora, weak multimodal capabilities, and narrow evaluations, making them less suited for real-world application. To address this, we introduce \textit{Open-FinLLMs}, the first open-source multimodal financial LLMs designed to handle diverse tasks across text, tabular, time-series, and chart data, excelling in zero-shot, few-shot, and fine-tuning settings. The suite includes FinLLaMA, pre-trained on a comprehensive 52-billion-token corpus; FinLLaMA-Instruct, fine-tuned with 573K financial instructions; and FinLLaVA, enhanced with 1.43M multimodal tuning pairs for strong cross-modal reasoning. We comprehensively evaluate Open-FinLLMs across 14 financial tasks, 30 datasets, and 4 multimodal tasks in zero-shot, few-shot, and supervised fine-tuning settings, introducing two new multimodal evaluation datasets. Our results show that Open-FinLLMs outperforms afvanced financial and general LLMs such as GPT-4, across financial NLP, decision-making, and multi-modal tasks, highlighting their potential to tackle real-world challenges. To foster innovation and collaboration across academia and industry, we release all codes (https://anonymous.4open.science/r/PIXIU2-0D70/B1D7/LICENSE) and models under OSI-approved licenses.

翻译：金融大语言模型在推进金融任务和领域特定应用方面前景广阔。然而，它们受限于稀缺的语料库、薄弱的多模态能力以及狭窄的评估范围，使其不太适合实际应用。为解决这些问题，我们推出了 \textit{Open-FinLLMs}，这是首个开源的多模态金融大语言模型，旨在处理文本、表格、时间序列和图表数据等多种任务，并在零样本、少样本和微调设置下表现优异。该套件包括：基于全面 520 亿词元语料库预训练的 FinLLaMA；通过 573K 条金融指令微调的 FinLLaMA-Instruct；以及通过 143 万对多模态调优数据增强、具备强大跨模态推理能力的 FinLLaVA。我们在零样本、少样本和有监督微调设置下，对 Open-FinLLMs 在 14 项金融任务、30 个数据集和 4 项多模态任务上进行了全面评估，并引入了两个新的多模态评估数据集。我们的结果表明，Open-FinLLMs 在金融自然语言处理、决策制定和多模态任务上均优于 GPT-4 等先进的金融及通用大语言模型，凸显了其应对现实世界挑战的潜力。为促进学术界和工业界的创新与合作，我们依据 OSI 批准的许可证发布了所有代码（https://anonymous.4open.science/r/PIXIU2-0D70/B1D7/LICENSE）和模型。