Invention Grant
- Patent Title: Scalable automatic data repair
-
Application No.: US13115253Application Date: 2011-05-25
-
Publication No.: US09619494B2Publication Date: 2017-04-11
- Inventor: Mohamed Yakout , Ahmed K. Elmagarmid , Laure Berti-Equille
- Applicant: Mohamed Yakout , Ahmed K. Elmagarmid , Laure Berti-Equille
- Applicant Address: QA Doha
- Assignee: QATAR FOUNDATION
- Current Assignee: QATAR FOUNDATION
- Current Assignee Address: QA Doha
- Agency: Mossman Kumar & Tyler PC
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A computer implemented method for generating a set of updates for a database comprising multiple records including erroneous, missing and inconsistent values, the method comprising using a set of partitioning functions for subdividing the records of the database into multiple subsets of records, allocating respective ones of the records to at least one subset according to a predetermined criteria for mapping records to subsets, applying multiple machine learning models to each of the subsets to determine respective candidate replacement values representing a tuple repair for a record including a probability of candidate and current values for the record, computing probabilities to select replacement values for the record from among the candidate replacement values which maximise the probability for values of the record for an updated database.
Public/Granted literature
- US20120303555A1 Scalable Automatic Data Repair Public/Granted day:2012-11-29
Information query