Invention Grant
- Patent Title: Data extraction confidence attribute with transformations
- Patent Title (中): 具有变换的数据提取置信度属性
-
Application No.: US13180068Application Date: 2011-07-11
-
Publication No.: US08676731B1Publication Date: 2014-03-18
- Inventor: Vinaya Sathyanarayana , Peeta Basa Pati , Salaka Sivananda , Rajarajan T. R.
- Applicant: Vinaya Sathyanarayana , Peeta Basa Pati , Salaka Sivananda , Rajarajan T. R.
- Applicant Address: US CA Santa Ana
- Assignee: CoreLogic, Inc.
- Current Assignee: CoreLogic, Inc.
- Current Assignee Address: US CA Santa Ana
- Agency: Monument IP Law Group
- Main IPC: G06F15/18
- IPC: G06F15/18

Abstract:
A data extraction system for receiving and scanning documents to generate ordered input for storage in a database employs a non-linear statistical model for a data extraction sequence having a plurality of transformations. Each transformation transitions an extracted data value in various forms from a raw data image to a computed data value. For each transformation, a confidence model learns a confidence component for the particular transformation. The learned confidence components, generated from a control set of documents having known values, are employed in a production mode with actual raw data. The confidence component corresponds to a likelihood of transformation accuracy, and the confidence model aggregates the confidence components to compute a confidence for the extracted data value. A database stores the extracted data value labeled with the computed confidence attribute for subsequent use by an application employing the extracted data.
Information query