Invention Grant
- Patent Title: Page classifier engine
- Patent Title (中): 页面分类引擎
-
Application No.: US11949586Application Date: 2007-12-03
-
Publication No.: US08392816B2Publication Date: 2013-03-05
- Inventor: Bogdan Radakovic , Aleksandar Uzelac , Bodin Dresevic , Oren Trutner
- Applicant: Bogdan Radakovic , Aleksandar Uzelac , Bodin Dresevic , Oren Trutner
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agency: Shook Hardy & Bacon L.L.P.
- Main IPC: G06F17/21
- IPC: G06F17/21

Abstract:
Embodiments of the present invention relate to classifying pages of an electronic document, such as a scanned book page. OCR software is applied to the contents of the electronic document, revealing semantic information about the content of the electronic document. Software-based features are applied to the semantic information to determine the type of page the electronic document is. Page types may include table of contents (TOC), table of figures (TOF), bibliography, index, or other types of pages commonly found in a book, magazine, or other publication. Once determined, the determined page type is stored and used by other software engines.
Public/Granted literature
- US20090144605A1 PAGE CLASSIFIER ENGINE Public/Granted day:2009-06-04
Information query