评估大型语言模型对SEC申报报告和财报电话会议记录中财务目标的立场检测 (Evaluating Large Language Models for Stance Detection on Financial Targets from SEC Filing Reports and Earnings Call Transcripts)

Financial narratives from U.S. Securities and Exchange Commission (SEC) filing reports and quarterly earnings call transcripts (ECTs) are very important for investors, auditors, and regulators. However, their length, financial jargon, and nuanced language make fine-grained analysis difficult. Prior sentiment analysis in the financial domain required a large, expensive labeled dataset, making the sentence-level stance towards specific financial targets challenging. In this work, we introduce a sentence-level corpus for stance detection focused on three core financial metrics: debt, earnings per share (EPS), and sales. The sentences were extracted from Form 10-K annual reports and ECTs, and labeled for stance (positive, negative, neutral) using the advanced ChatGPT-o3-pro model under rigorous human validation. Using this corpus, we conduct a systematic evaluation of modern large language models (LLMs) using zero-shot, few-shot, and Chain-of-Thought (CoT) prompting strategies. Our results show that few-shot with CoT prompting performs best compared to supervised baselines, and LLMs' performance varies across the SEC and ECT datasets. Our findings highlight the practical viability of leveraging LLMs for target-specific stance in the financial domain without requiring extensive labeled data.

翻译：美国证券交易委员会(SEC)申报报告和季度财报电话会议记录(ECTs)中的财务叙述对投资者、审计师和监管机构至关重要。然而，其篇幅长度、财务术语及微妙语言表达使得细粒度分析变得困难。先前金融领域的情感分析需要大规模、高成本的标注数据集，导致针对特定财务目标的句子级立场分析颇具挑战性。本研究引入了一个专注于三个核心财务指标——债务、每股收益(EPS)和销售额——的句子级立场检测语料库。该语料库的句子提取自10-K年度报告和ECTs，并采用先进的ChatGPT-o3-pro模型在严格人工验证下进行立场标注（积极、消极、中立）。基于此语料库，我们采用零样本、少样本和思维链(CoT)提示策略对现代大型语言模型(LLMs)进行了系统评估。结果表明：与监督基线相比，少样本结合CoT提示策略表现最佳；且LLMs在SEC和ECT数据集上的性能存在差异。我们的发现凸显了在金融领域利用LLMs进行目标特异性立场分析的实际可行性，而无需依赖大量标注数据。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日