Invention Grant
- Patent Title: Discovery of linkage points between data sources
-
Application No.: US16794895Application Date: 2020-02-19
-
Publication No.: US11531717B2Publication Date: 2022-12-20
- Inventor: Oktie Hassanzadeh , Mauricio A. Hernandez-Sherrington , Ching-Tien Ho , Lucian Popa
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agent Michael O'Keefe
- Main IPC: G06F16/9535
- IPC: G06F16/9535 ; G06F16/25 ; G06F16/27 ; G06F16/2457

Abstract:
Data records are linked across a plurality of datasets. Each dataset contains at least one data record, and each data record is associated with an entity and includes one or more attributes of that entity and a value for each attribute. Values associated with attributes are compared across datasets, and matching attributes having values that satisfy a predetermined similarity threshold are identified. In addition, linkage points between pairs of datasets are identified. Each linkage point links one or more pairs of data records. Each data record in the pair of data records is contained in one of a given pair of datasets, and each pair of data records is associated with a common entity having matching attributes in the given pair of datasets. Data records associated with the common entities are linked across datasets using the identified linkage points.
Public/Granted literature
- US20200183995A1 DISCOVERY OF LINKAGE POINTS BETWEEN DATA SOURCES Public/Granted day:2020-06-11
Information query