Methods and systems for finding elements in optical character recognition documents
Abstract:
Embodiments for finding elements in optical character recognition (OCR) documents are provided. An indication of a selected portion of document is received. Salient pixels in the selected portion of the document are determined. Properties of the salient pixels in the selected portion of the document are identified. The properties of the salient pixels in the selected portion of the document are compared to properties of pixels in each of a plurality of portions of an OCR-converted version of the document. A cognitive analysis is utilized to select at least some of the plurality of portions of the OCR-converted version of the document as suspected matches to the selected portion of the document.
Information query
Patent Agency Ranking
0/0