With the exponential growth in large language models (LLMs), leveraging their emergent properties for specialized domains like finance merits exploration. However, regulated fields such as finance pose unique constraints, requiring domain-optimized frameworks. We present ConFIRM, an LLM-based conversational financial information retrieval model tailored for query intent classification and knowledge base labeling. ConFIRM comprises two modules: 1) a method to synthesize finance domain-specific question-answer pairs, and 2) evaluation of parameter efficient fine-tuning approaches for the query classification task. We generate a dataset of over 4000 samples, assessing accuracy on a separate test set. ConFIRM achieved over 90% accuracy, essential for regulatory compliance. ConFIRM provides a data-efficient solution to extract precise query intent for financial dialog systems.
翻译:随着大型语言模型(LLMs)的指数级增长,利用其涌现特性服务于金融等专业领域值得深入探索。然而,金融等受监管领域存在独特约束,需要领域优化的框架。我们提出ConFIRM——一种基于LLM的会话式金融信息检索模型,专为查询意图分类和知识库标注而设计。ConFIRM包含两个模块:1)合成金融领域特定问答对的方法;2)针对查询分类任务的参数高效微调方法评估。我们生成了包含4000余个样本的数据集,并在独立测试集上评估准确率。ConFIRM实现了超过90%的准确率,这对满足监管合规性至关重要。该模型为金融对话系统提供了数据高效型的精确查询意图提取解决方案。