Invention Grant
US08571270B2 Segmentation of a word bitmap into individual characters or glyphs during an OCR process
有权
在OCR过程中将单词位图分割成单个字符或字形
- Patent Title: Segmentation of a word bitmap into individual characters or glyphs during an OCR process
- Patent Title (中): 在OCR过程中将单词位图分割成单个字符或字形
-
Application No.: US12776576Application Date: 2010-05-10
-
Publication No.: US08571270B2Publication Date: 2013-10-29
- Inventor: Djordje Nijemcevic
- Applicant: Djordje Nijemcevic
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agency: Mayer & Williams, PC
- Main IPC: G06K9/34
- IPC: G06K9/34

Abstract:
An image processing apparatus is provided that includes a character chopper component that segments words into individual characters in a bitmap of a textual image undergoing an OCR process. The Character chopper component is configured to produce a set of (possibly curved) chop-lines which divide a bitmap of any given word into its individual character or glyph candidates. Cases where an input bitmap contains two separate words are handled by marking a place where those words should be split. The character segmentation algorithm computes the set of vertically oriented, curved chop-lines by considering glyph and background colors in a given word bitmap. The set is filtered afterwards using various heuristics, in order to preserve those lines that indeed do separate a word's glyphs and minimize the number of those that do not.
Public/Granted literature
- US20110274354A1 SEGMENTATION OF A WORD BITMAP INTO INDIVIDUAL CHARACTERS OR GLYPHS DURING AN OCR PROCESS Public/Granted day:2011-11-10
Information query