Extracting key information from documents represents a large portion of business workloads and therefore offers a high potential for efficiency improvements and process automation. With recent advances in Deep Learning, a plethora of Deep Learning based approaches for Key Information Extraction have been proposed under the umbrella term Document Understanding that enable the processing of complex business documents. The goal of this systematic literature review is an in-depth analysis of existing approaches in this domain and the identification of opportunities for further research. To this end, 130 approaches published between 2017 and 2024 are analyzed in this study.
翻译:从文档中提取关键信息构成了商业工作负载的重要组成部分,因而在提升效率与实现流程自动化方面具有巨大潜力。随着深度学习技术的近期进展,在“文档理解”这一总括术语下,已涌现出大量基于深度学习的关键信息提取方法,使得处理复杂的商业文档成为可能。本系统性文献综述的目标在于深入分析该领域的现有方法,并识别进一步研究的机遇。为此,本研究分析了2017年至2024年间发表的130种方法。