Invention Grant
- Patent Title: End-to-end system for extracting tabular data present in electronic documents and method thereof
-
Application No.: US17683954Application Date: 2022-03-01
-
Publication No.: US11887393B2Publication Date: 2024-01-30
- Inventor: Nandhinee Periyakaruppan , Harinath Krishnamoorthy , Anil Goyal , Sudarsun Santhiappan
- Applicant: CLARITRICS INC.
- Applicant Address: US NY New York
- Assignee: CLARITRICS INC.
- Current Assignee: CLARITRICS INC.
- Current Assignee Address: US NY New York
- Agency: Foley & Lardner LLP
- Main IPC: G06V30/412
- IPC: G06V30/412 ; G06V30/414 ; G06V30/18 ; G06V30/19 ; G06V30/184

Abstract:
The present disclosure describes a method, system, and a computer readable medium for extracting tabular data present in a document. The method comprises detecting presence of at least one table in the document using a deep learning based model and a statistical method. The method further comprises identifying a type of the table based on determining a count of horizontal and vertical lines, presence of outer borders, and presence of row-column intersections in the table. The type of the table comprises a bordered table, a partially bordered table, or a borderless table. The method further comprises processing the detected table, depending on its type, to identify one or more cells present in the table. The method further comprises generating an output file by extracting the tabular data present in the table, where the extracting comprises performing optical character recognition on the identified one or more cells.
Public/Granted literature
- US20220284722A1 END-TO-END SYSTEM FOR EXTRACTING TABULAR DATA PRESENT IN ELECTRONIC DOCUMENTS AND METHOD THEREOF Public/Granted day:2022-09-08
Information query