Invention Grant
- Patent Title: Ingestion planning for complex tables
-
Application No.: US16660877Application Date: 2019-10-23
-
Publication No.: US11244011B2Publication Date: 2022-02-08
- Inventor: Paul R. Bastide , Matthew E. Broomhall , Donna K. Byron , Robert E. Loredo
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Jordan T. Schiller
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06F40/00 ; G06F16/93 ; G06F16/48 ; G06F16/14 ; G06F16/242 ; G06F16/31 ; G06F16/34 ; G06F16/22 ; G06F16/332 ; G06F16/33 ; G06F16/2457 ; G06F16/25 ; G06F40/56 ; G06F40/151 ; G06F40/177 ; G06F16/41 ; G06F40/20

Abstract:
Embodiments of the present invention disclose a method, computer program product, and system for generating a plan for document processing. A plurality of electronic documents are received, by a computer, using a network. The plurality of electronic documents are analyzed, using the computer, to identify a plurality of tabular data, based on the analyzed plurality of electronic documents. Textual data is identified within the identified tabular data, of the analyzed plurality of electronic documents. Textual hints are generated, based on the identified textual data within the identified tabular data. References are identified, wherein references are based on matching textual hints with textual data in the received plurality of electronic documents. A count of references is calculated, associated with one or more sets of tabular data. A priority score is calculated based on the count of references, and an ingestion plan is generated, based on the calculated priority score.
Public/Granted literature
- US20200050643A1 INGESTION PLANNING FOR COMPLEX TABLES Public/Granted day:2020-02-13
Information query