Invention Grant
- Patent Title: Automated document processing system
- Patent Title (中): 自动文件处理系统
-
Application No.: US11253305Application Date: 2005-10-19
-
Publication No.: US08948511B2Publication Date: 2015-02-03
- Inventor: Daniel Ortega , Sherif Yacoub , Jose Abad Peiro , Paolo Faraboschi
- Applicant: Daniel Ortega , Sherif Yacoub , Jose Abad Peiro , Paolo Faraboschi
- Applicant Address: US TX Houston
- Assignee: Hewlett-Packard Development Company, L.P.
- Current Assignee: Hewlett-Packard Development Company, L.P.
- Current Assignee Address: US TX Houston
- Agency: Lee & Hayes, PLLC
- Agent David S. Thompson
- Main IPC: G06K9/34
- IPC: G06K9/34 ; G06K9/00

Abstract:
An automated document processing system is configured to normalize zones obtained from a document, and to extract articles from the normalized zones. In one configuration, the system receives at least one zone from the document, and applies at least one zone-breaking factor, thereby creating normalized sub-zones within which text lines are consistent with the at least one zone-breaking factor. The normalized sub-zones may be evaluated to obtain a reading order. Adjacent sub-zones are joined if text similarity exceeds a threshold value. Weakly joined sub-zones are separated where indicated by a topic vectors analysis of the weakly joined sub-zones.
Public/Granted literature
- US20060274938A1 Automated document processing system Public/Granted day:2006-12-07
Information query