Invention Grant
US08150160B2 Automatic Arabic text image optical character recognition method 有权
自动阿拉伯文字图像光学字符识别方法

Automatic Arabic text image optical character recognition method
Abstract:
The automatic Arabic text image optical character recognition method includes training a text recognition system using Arabic printed text, using the produced models for classification of newly unseen Arabic scanned text, and generating the corresponding textual information. Scanned images of Arabic text and copies of minimal Arabic text are used in the training sessions. Each page is segmented into lines. Features of each line are extracted and input to Hidden Markov Model (HMM). All training data training features are used. HMM runs training algorithms to produce codebook and language models. In the classification stage new Arabic text is input in scanned form. Line segmentation where lines are extracted is passed through. In the feature stage, line features are extracted and input to the classification stage. In the classification stage the corresponding Arabic text is generated.
Public/Granted literature
Information query
Patent Agency Ranking
0/0