Invention Grant
US08594422B2 Page layout determination of an image undergoing optical character recognition
有权
正在进行光学字符识别的图像的页面布局确定
- Patent Title: Page layout determination of an image undergoing optical character recognition
- Patent Title (中): 正在进行光学字符识别的图像的页面布局确定
-
Application No.: US12721949Application Date: 2010-03-11
-
Publication No.: US08594422B2Publication Date: 2013-11-26
- Inventor: Mircea Cimpoi , Sasa Galic , Milan Vugdelija
- Applicant: Mircea Cimpoi , Sasa Galic , Milan Vugdelija
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agency: Collins & Collins Intellectual, LLC
- Agent L. Alan Collins
- Main IPC: G06K9/00
- IPC: G06K9/00

Abstract:
A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.
Public/Granted literature
- US20110222771A1 PAGE LAYOUT DETERMINATION OF AN IMAGE UNDERGOING OPTICAL CHARACTER RECOGNITION Public/Granted day:2011-09-15
Information query