Invention Grant
- Patent Title: Active learning for data matching
-
Application No.: US16859107Application Date: 2020-04-27
-
Publication No.: US11409772B2Publication Date: 2022-08-09
- Inventor: Lars Bremer , Utkarsh Bajpai , Martin Oberhofer , Alexandre Luz Xavier Da Costa
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Kelli D. Morin
- Priority: EP19189991 20190805
- Main IPC: G06F16/28
- IPC: G06F16/28 ; G06N20/00

Abstract:
A method includes training a machine learning model using a current set of labeled data points. Each of the data points is multiple data records. A label of a data point indicates a classification of the data point. The training results in a trained machine learning model configured to classify a data point as representing a same entity or different entities. The method includes selecting a subset of unlabeled data points from a current set of unlabeled data points using classification results of the current set of unlabeled data points. The method includes providing the subset of unlabeled data points to a classifier and in response to providing receiving labels of the subset of unlabeled data points. The method may be repeated using the subset of labeled data points in addition to the current set of labeled data points as the current set of labeled data points.
Public/Granted literature
- US20210042330A1 ACTIVE LEARNING FOR DATA MATCHING Public/Granted day:2021-02-11
Information query