Invention Grant
- Patent Title: Word recognition of text undergoing an OCR process
- Patent Title (中): 对正在进行OCR流程的文本的Word识别
-
Application No.: US12772376Application Date: 2010-05-03
-
Publication No.: US08401293B2Publication Date: 2013-03-19
- Inventor: Aleksandar Antonijevic , Ivan Mitic , Mircea Cimpoi , Djordje Nijemcevic
- Applicant: Aleksandar Antonijevic , Ivan Mitic , Mircea Cimpoi , Djordje Nijemcevic
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agency: Mayer & Williams PC
- Main IPC: G06K9/00
- IPC: G06K9/00

Abstract:
A method for identifying words in a textual image undergoing optical character recognition includes receiving a bitmap of an input image which includes textual lines that have been segmented by a plurality of chop lines. The chop lines are each associated with a confidence level reflecting a degree to which the respective chop line properly segments the textual line into individual characters. One or more words are identified in one of the textual lines based at least in part on the textual lines and a first subset of the plurality of chop lines which have a chop line confidence level above a first threshold value. If the first word is not associated with a sufficiently high word confidence level, at least a second word in the textual line is identified based at least in part on a second subset of the plurality of chop lines which have a confidence level above a second threshold value lower than the first threshold value.
Public/Granted literature
- US20110268360A1 WORD RECOGNITION OF TEXT UNDERGOING AN OCR PROCESS Public/Granted day:2011-11-03
Information query