FinCARDS: Card-Based Analyst Reranking for Financial Document Question Answering

Financial question answering (QA) over long corporate filings requires evidence to satisfy strict constraints on entities, financial metrics, fiscal periods, and numeric values. However, existing LLM-based rerankers primarily optimize semantic relevance, leading to unstable rankings and opaque decisions on long documents. We propose FinCards, a structured reranking framework that reframes financial evidence selection as constraint satisfaction under a finance-aware schema. FinCards represents filing chunks and questions using aligned schema fields (entities, metrics, periods, and numeric spans), enabling deterministic field-level matching. Evidence is selected via a multi-stage tournament reranking with stability-aware aggregation, producing auditable decision traces. Across two corporate filing QA benchmarks, FinCards substantially improves early-rank retrieval over both lexical and LLM-based reranking baselines, while reducing ranking variance, without requiring model fine-tuning or unpredictable inference budgets. Our code is available at https://github.com/XanderZhou2022/FINCARDS.

翻译：金融问答任务需从长篇企业财报中提取证据，以满足对实体、财务指标、会计期间及数值的严格约束。然而，现有基于大语言模型的重排序方法主要优化语义相关性，导致长文档排序结果不稳定且决策过程不透明。本文提出FinCards结构化重排序框架，将金融证据选择问题重新定义为在金融感知模式下的约束满足问题。该框架利用对齐后的模式字段（实体、指标、会计期间和数值跨度）表示文档块与问题，实现确定性字段级匹配。通过多阶段锦标赛式重排序结合稳定性感知聚合策略进行证据选择，生成可审计的决策轨迹。在两个企业财报问答基准测试中，FinCards在不需模型微调或不可预测推理预算的情况下，相较于基于词法和LLM的重排序基线，显著提升了早期排序召回率，同时降低了排序方差。本论文代码已开源：https://github.com/XanderZhou2022/FINCARDS

相关内容

排序

关注 313

排序是计算机内经常进行的一种操作，其目的是将一组“无序”的记录序列调整为“有序”的记录序列。分内部排序和外部排序。若整个排序过程不需要访问外存便能完成，则称此类排序问题为内部排序。反之，若参加排序的记录数量很大，整个序列的排序过程不可能在内存中完成，则称此类排序问题为外部排序。内部排序的过程是一个逐步扩大记录的有序序列长度的过程。

【AAAI2026】FinRpt：面向证券研究报告生成的数据集、评测体系与基于大语言模型的多智能体框架

专知会员服务

20+阅读 · 2025年11月11日

SORA底层模型用好了也能赚钱！DiffsFormer：基于扩散模型的股票因子生成

专知会员服务

37+阅读 · 2024年2月29日

【牛津大学博士论文】基于数据驱动的金融时间序列模拟和预测方法，238页pdf

专知会员服务

62+阅读 · 2023年9月4日

【RecSys22教程】多阶段推荐系统的神经重排序，90页ppt

专知会员服务

27+阅读 · 2022年9月30日