- Patent Title: Method and system for determining structural blocks of a document
-
Application No.: US16134657Application Date: 2018-09-18
-
Publication No.: US10691937B2Publication Date: 2020-06-23
- Inventor: Raghavendra Hosabettu , Sneha Subhaschandra Banakar
- Applicant: Wipro Limited
- Applicant Address: IN Bangalore
- Assignee: Wipro Limited
- Current Assignee: Wipro Limited
- Current Assignee Address: IN Bangalore
- Agency: Finnegan, Henderson, Farabow, Garrett & Dunner, LLP
- Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@2824574c
- Main IPC: G06K9/34
- IPC: G06K9/34 ; G06K9/00 ; G06K9/72

Abstract:
This disclosure relates to method and system for determining structural blocks of a document. The method may include extracting text lines from the document, generating a feature vector for each text line by determining feature values for a set of features in the each text line, and determining at least one dominant feature from among the set of features and at least one corresponding dominance factor, for each structural class, based on the feature vector for each text line. The method may further include deriving a set of rules for classification of the text lines into respective structural classes and determining a structural block tag for each text line based on the set of rules. Each of the set of rules correspond to one of the structural classes and is based on the at least one dominant feature and the at least one corresponding dominance factor for that class.
Public/Granted literature
- US20200034611A1 METHOD AND SYSTEM FOR DETERMINING STRUCTURAL BLOCKS OF A DOCUMENT Public/Granted day:2020-01-30
Information query