Invention Grant
- Patent Title: Analyzing documents using stored templates
- Patent Title (中): 使用存储的模板分析文档
-
Application No.: US12732278Application Date: 2010-03-26
-
Publication No.: US08422786B2Publication Date: 2013-04-16
- Inventor: Vijil E. Chenthamarakshan , Rafah A. Hosn , Nandakishore Kambhatla , Debapriyo Majumdar , Shajith I. Mohamed , Soumitra Sarkar
- Applicant: Vijil E. Chenthamarakshan , Rafah A. Hosn , Nandakishore Kambhatla , Debapriyo Majumdar , Shajith I. Mohamed , Soumitra Sarkar
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Ference & Associates, LLC
- Main IPC: G06K9/34
- IPC: G06K9/34 ; G06K9/00

Abstract:
A method, a system and a computer program product for analyzing a document are disclosed. In response to receiving the document, the document is partitioned into a plurality of segments using a set of pre-defined attributes. The plurality of segments of the document is mapped with corresponding segments of at least one template selected from a set of stored templates. A first template from the set of stored templates is selected and a group of segments in the first template is identified by computing at least one of a structural similarity and a textual similarity associated with the group of segments compared with the plurality of segments of the document. A subset of segments from the group of segments is aligned with corresponding segments from the plurality of segments of the document. A set of scores is computed using a set of pre-defined criteria, in response to the mapping. The document is analyzed based on the computed set of scores.
Public/Granted literature
- US20110235909A1 ANALYZING DOCUMENTS USING STORED TEMPLATES Public/Granted day:2011-09-29
Information query