Invention Grant
- Patent Title: Structural clustering and alignment of OCR results
-
Application No.: US16234148Application Date: 2018-12-27
-
Publication No.: US10824899B2Publication Date: 2020-11-03
- Inventor: Yan Wang , Arun Sacheti , Vishal Chhabilbhai Thakkar , Surendra Srinivas Ulabala , Shloak Jain
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Medley, Behrens & Lewis, LLC
- Main IPC: G06K9/46
- IPC: G06K9/46 ; G06T11/20

Abstract:
Representative embodiments disclose mechanisms to create a text stream from raw OCR outputs. The raw OCR output comprises a plurality of bounding boxes, each bounding box defining a region containing text which has been recognized by the OCR system. A weight matrix is calculated that comprises a weight for each pair of bounding boxes. The weight representing the probability that a pair of bounding boxes belongs to the same cluster. The bounding boxes are then clustered along the weights. The resulting clusters are first ordered using an ordering criteria. The bounding boxes within each cluster are then ordered according to a second ordering criteria. The ordered clusters and bounding boxes are then arranged into a text stream.
Public/Granted literature
- US20200210743A1 STRUCTURAL CLUSTERING AND ALIGNMENT OF OCR RESULTS Public/Granted day:2020-07-02
Information query