Invention Grant
US09430703B2 Method for segmenting text words in document images using vertical projections of center zones of characters
有权
使用字符中心区域的垂直投影来分割文档图像中的文本单词的方法
- Patent Title: Method for segmenting text words in document images using vertical projections of center zones of characters
- Patent Title (中): 使用字符中心区域的垂直投影来分割文档图像中的文本单词的方法
-
Application No.: US14578066Application Date: 2014-12-19
-
Publication No.: US09430703B2Publication Date: 2016-08-30
- Inventor: Wei Ming
- Applicant: KONICA MINOLTA LABORATORY U.S.A., INC.
- Applicant Address: US CA San Mateo
- Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.
- Current Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.
- Current Assignee Address: US CA San Mateo
- Agency: Chen Yoshimura LLP
- Main IPC: G06K9/34
- IPC: G06K9/34 ; G06K9/00 ; G06K9/46 ; G06K9/32

Abstract:
A word segmentation method for segmenting a text line into word segments, which is particularly advantageous for processing italic text but can also be used for regular text. A horizontal center zone of the text line, corresponding to the vertical center parts of the characters, is used to generate a center-zone-only vertical projection profile. The center zone is determined using a horizontal projection profile, by locating the two major peaks of that profile and defining the two major peak positions as the upper and lower boundaries of the center zone. Spacing segments (white gaps) in the vertical projection profile are identified, and classified into two classes, namely character spacing (gap between characters with a word) and word spacing (gap between words). The word spacings are used to segment the text line into word segments.
Public/Granted literature
Information query