Invention Grant
- Patent Title: Method and system for tabular information extraction
-
Application No.: US18135369Application Date: 2023-04-17
-
Publication No.: US12087072B2Publication Date: 2024-09-10
- Inventor: Sidharth Talwar , Sanjay Saran Garg , Ranjit Radhakrishnan , Sunil Nair , Devang Jayachandran
- Applicant: JPMorgan Chase Bank, N.A.
- Applicant Address: US NY New York
- Assignee: JPMORGAN CHASE BANK, N.A.
- Current Assignee: JPMORGAN CHASE BANK, N.A.
- Current Assignee Address: US NY New York
- Agency: Greenblum & Bernstein, P.L.C.
- Main IPC: G06V30/414
- IPC: G06V30/414 ; G06V30/148 ; G06V30/412

Abstract:
A method and a system for extracting information from a table in a document is provided. The method includes: receiving a document that includes information that is arranged in a table; determining three sets of coordinates that respectively relate to lines, words, and characters included in the document; extracting a list of lines based on the first set of coordinates; reconstructing the rows of the table based on list of lines and the second set of coordinates; reconstructing the columns of the table based on the reconstructed rows and the third set of coordinates; and outputting a reconstruction of the table. The three sets of coordinates are expressible in an hOCR format that is based on an open standard for representation of scanned information that is obtainable by using an optical character recognition (OCR) technique.
Public/Granted literature
- US20230260311A1 METHOD AND SYSTEM FOR TABULAR INFORMATION EXTRACTION Public/Granted day:2023-08-17
Information query