Invention Grant
US08625886B2 Finding repeated structure for data extraction from document images
有权
从文档图像中查找重复的数据提取结构
- Patent Title: Finding repeated structure for data extraction from document images
- Patent Title (中): 从文档图像中查找重复的数据提取结构
-
Application No.: US13022877Application Date: 2011-02-08
-
Publication No.: US08625886B2Publication Date: 2014-01-07
- Inventor: Evgeniy Bart , Prateek Sarkar , Eric Saund
- Applicant: Evgeniy Bart , Prateek Sarkar , Eric Saund
- Applicant Address: US CA Palo Alto
- Assignee: Palo Alto Research Center Incorporated
- Current Assignee: Palo Alto Research Center Incorporated
- Current Assignee Address: US CA Palo Alto
- Agency: Fay Sharpe LLP
- Main IPC: G06K9/62
- IPC: G06K9/62

Abstract:
Methods and system employing the same for finding repeated structure for data extraction from document images are provided. A reference record and one or more reference fields thereof are identified from a document image. One or more candidate fields are generated for each of the reference fields. One or more best candidate records from the candidate fields are selected using a probabilistic model and an optimal record set is determined from the best candidate records.
Public/Granted literature
- US20120201457A1 FINDING REPEATED STRUCTURE FOR DATA EXTRACTION FROM DOCUMENT IMAGES Public/Granted day:2012-08-09
Information query