Process of information extraction (IE) is often used to extract meaningful information from unstructured and unlabeled data. Conventional methods of data extraction including application of OCR and passing extraction engine, are inefficient on large data and have their limitation. In this paper, a peculiar technique of information extraction is proposed using A2I and computer vision technologies, which also includes NLP.
翻译:信息提取(IE)过程常用于从非结构化及无标注数据中提取有意义的信息。传统的数据提取方法,包括应用OCR及传递提取引擎,在处理大规模数据时效率低下且存在局限性。本文提出一种结合A2I、计算机视觉技术及自然语言处理(NLP)的独特信息提取方法。