Invention Grant
- Patent Title: Systems and methods for data extraction from electronic documents using data patterns
-
Application No.: US17064150Application Date: 2020-10-06
-
Publication No.: US11625419B2Publication Date: 2023-04-11
- Inventor: Punitha Chandrasekar , Sourav Karmakar , Amol Vinayak Jadhav , Bidhan Roy , Victor S. Y. Lo , Varun Vivek Aher , Ankit Garg
- Applicant: FMR LLC
- Applicant Address: US MA Boston
- Assignee: FMR LLC
- Current Assignee: FMR LLC
- Current Assignee Address: US MA Boston
- Agency: Cesari & McKenna, LLP
- Main IPC: G06F16/28
- IPC: G06F16/28 ; G06F16/93 ; G06F40/186 ; G06F16/22 ; G06V30/416

Abstract:
Systems and methods for extracting data from electronic documents based on data patterns. The method includes receiving electronic template documents. Each template document corresponds to a type of electronic document. The method further includes, for each template document, processing the template document using a text extraction and data processing application. The method also includes, for each template document, determining a data extraction formula corresponding to the type of electronic document. The method further includes, storing the data extraction formula in a first database. The method also includes, receiving an electronic document including user data and a Unicode corresponding to the type of document. The method also includes, processing and classifying the electronic document into the type of document corresponding to the Unicode. The method also includes identifying data elements in the electronic document based on the data extraction formula and extracting data values for each of the identified data elements.
Public/Granted literature
- US20220107964A1 Systems and Methods for Data Extraction from Electronic Documents Using Data Patterns Public/Granted day:2022-04-07
Information query