Invention Grant
US08218875B2 Method and system for preprocessing an image for optical character recognition
失效
用于光学字符识别的图像预处理方法和系统
- Patent Title: Method and system for preprocessing an image for optical character recognition
- Patent Title (中): 用于光学字符识别的图像预处理方法和系统
-
Application No.: US12814448Application Date: 2010-06-12
-
Publication No.: US08218875B2Publication Date: 2012-07-10
- Inventor: Hussein Khalid Al-Omari , Mohammad Sulaiman Khorsheed
- Applicant: Hussein Khalid Al-Omari , Mohammad Sulaiman Khorsheed
- Main IPC: G06K9/34
- IPC: G06K9/34 ; G06K9/18 ; G06K9/00 ; G06K9/36 ; H04N1/04

Abstract:
A method and system for preprocessing an image for Optical Character Recognition (OCR), wherein the image includes a plurality of columns is disclosed. Each column includes one or more of Arabic text and non-text items. The method includes determining a plurality of components associated with one or more of the Arabic text and the non-text items, wherein a component includes a set of connected pixels. On determining the plurality of components, a line height and a column spacing is determined for the plurality of components. The plurality of components are then associated with a column of the plurality of columns based on the line height and the column spacing. Subsequently, a set of characteristic parameters are calculated for each column and the plurality of components of each column are merged based on the set of characteristic parameters to form sub-words and words.
Public/Granted literature
- US20110305387A1 METHOD AND SYSTEM FOR PREPROCESSING AN IMAGE FOR OPTICAL CHARACTER RECOGNITION Public/Granted day:2011-12-15
Information query