Ingestion planning for complex tables
Abstract:
Embodiments of the present invention disclose a method, computer program product, and system for generating a plan for document processing. A plurality of electronic documents are received, by a computer, using a network. The plurality of electronic documents are analyzed, using the computer, to identify a plurality of tabular data, based on the analyzed plurality of electronic documents. Textual data is identified within the identified tabular data, of the analyzed plurality of electronic documents. Textual hints are generated, based on the identified textual data within the identified tabular data. References are identified, wherein references are based on matching textual hints with textual data in the received plurality of electronic documents. A count of references is calculated, associated with one or more sets of tabular data. A priority score is calculated based on the count of references, and an ingestion plan is generated, based on the calculated priority score.
Public/Granted literature
Information query
Patent Agency Ranking
0/0