Relation extraction (RE) is a crucial task in natural language processing (NLP) that aims to identify and classify relationships between entities mentioned in text. In the financial domain, relation extraction plays a vital role in extracting valuable information from financial documents, such as news articles, earnings reports, and company filings. This paper describes our solution to relation extraction on one such dataset REFinD. The dataset was released along with shared task as a part of the Fourth Workshop on Knowledge Discovery from Unstructured Data in Financial Services, co-located with SIGIR 2023. In this paper, we employed OpenAI models under the framework of in-context learning (ICL). We utilized two retrieval strategies to find top K relevant in-context learning demonstrations / examples from training data for a given test example. The first retrieval mechanism, we employed, is a learning-free dense retriever and the other system is a learning-based retriever. We were able to achieve 3rd rank overall. Our best F1-score is 0.718.
翻译:关系抽取是自然语言处理中的关键任务,旨在识别和分类文本中提及实体之间的关系。在金融领域,关系抽取对于从新闻文章、财报、公司申报等金融文档中提取有价值信息具有重要作用。本文描述了我们在REFinD数据集上的关系抽取解决方案。该数据集作为第四届金融服务非结构化数据知识发现研讨会(与SIGIR 2023同期举办)的共享任务而发布。本文采用上下文学习框架结合OpenAI模型,通过两种检索策略从训练数据中为给定测试样例选取前K个相关的上下文学习示例。第一种检索机制采用无监督稠密检索器,第二种系统则基于学习型检索器。我们最终获得总排名第三的成绩,最佳F1分数达到0.718。