Invention Grant
- Patent Title: System and method for extracting tabular data from electronic document
-
Application No.: US16366428Application Date: 2019-03-27
-
Publication No.: US10970535B2Publication Date: 2021-04-06
- Inventor: Shubhojit Mallick , Kedar Bartake , Omkar Kumbhar
- Applicant: Innoplexus AG
- Applicant Address: DE Eschborn
- Assignee: Innoplexus AG
- Current Assignee: Innoplexus AG
- Current Assignee Address: DE Eschborn
- Agency: Ziegler IP Law Group, LLC
- Priority: GB1809546 20180611
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06K9/34 ; G06K9/40

Abstract:
Disclosed is system for extracting tabular data from electronic document, system having data processing arrangement comprising: tabular data detection module that is operable to: (i) receive electronic document; (ii) determine location of tabular data within electronic document; and (iii) extract image of tabular data from electronic document; and tabular data extraction module that receives extracted image of tabular data from tabular data detection module, wherein tabular data extraction module is operable to: (i) convert received image of tabular data into greyscale image; (ii) extract grid structure from greyscale image; (iii) remove grid structure from greyscale image; (iv) determine position for placement of horizontal and vertical lines in greyscale image; (v) generate horizontal and vertical lines on greyscale image; (vi) perform optical character recognition of text associated with tabular data from received image; and (vii) extract tabular data by combining information of grid structure with text, to generate tabular data.
Public/Granted literature
- US20200089946A1 SYSTEM AND METHOD FOR EXTRACTING TABULAR DATA FROM ELECTRONIC DOCUMENT Public/Granted day:2020-03-19
Information query