Invention Grant
- Patent Title: Page layout determination of an image undergoing optical character recognition
-
Application No.: US14079395Application Date: 2013-11-13
-
Publication No.: US09785849B2Publication Date: 2017-10-10
- Inventor: Mircea Cimpoi , Sasa Galic , Milan Vugdelija
- Applicant: Microsoft Corporation
- Applicant Address: US WA Redmond
- Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
- Current Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
- Current Assignee Address: US WA Redmond
- Agency: Collins & Collins Intellectual, LLC
- Agent L. Alan Collins
- Main IPC: G06K9/18
- IPC: G06K9/18 ; G06K9/00 ; G06K9/32

Abstract:
A method and system is provided for identifying a page layout of an image that includes textual regions. The textual regions are to undergo optical character recognition (OCR). The system includes an input component that receives an input image that includes words around which bounding boxes have been formed and a text identifying component that groups the words into a plurality of text regions. A reading line component groups words within each of the text regions into reading lines. A text region sorting component that sorts the text regions in accordance with their reading order.
Public/Granted literature
- US20140072224A1 PAGE LAYOUT DETERMINATION OF AN IMAGE UNDERGOING OPTICAL CHARACTER RECOGNITION Public/Granted day:2014-03-13
Information query