Invention Grant
US08260049B2 Model-based method of document logical structure recognition in OCR systems
有权
OCR系统中基于模型的文档逻辑结构识别方法
- Patent Title: Model-based method of document logical structure recognition in OCR systems
- Patent Title (中): OCR系统中基于模型的文档逻辑结构识别方法
-
Application No.: US12236054Application Date: 2008-09-23
-
Publication No.: US08260049B2Publication Date: 2012-09-04
- Inventor: Dmitry Deryagin , Konstantin Anisimovich
- Applicant: Dmitry Deryagin , Konstantin Anisimovich
- Applicant Address: CY Nicosia
- Assignee: ABBYY Software Ltd.
- Current Assignee: ABBYY Software Ltd.
- Current Assignee Address: CY Nicosia
- Agent John Chandler Meline; LeighAnn Welland
- Main IPC: G06K9/34
- IPC: G06K9/34

Abstract:
In one embodiment, the invention provides a method for determining a logical structure of a document. The method comprises generating at least one document hypothesis for the whole document; for each document hypothesis, verifying said document hypothesis including (a) generating at least one block hypothesis for each block in the document based on the document hypothesis; and (b) selecting a best block hypothesis for each block; selecting as a best document hypothesis the document hypothesis that has the best degree of correspondence with the selected best block hypotheses for the document; and forming the document based on the best document hypothesis.
Public/Granted literature
- US20090087094A1 MODEL-BASED METHOD OF DOCUMENT LOGICAL STRUCTURE RECOGNITION IN OCR SYSTEMS Public/Granted day:2009-04-02
Information query