Invention Grant
- Patent Title: Method to automatically join datasets with different geographic location naming conventions
-
Application No.: US16785314Application Date: 2020-02-07
-
Publication No.: US11243954B2Publication Date: 2022-02-08
- Inventor: Lin Luo , Changying Sun , Graham Wills , Mohammed Mostafa
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Yee & Associates, P.C.
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06F16/2455 ; G06F16/28

Abstract:
A computer-implemented method for joining data sets with mismatched geographic location naming conventions is provided. The method includes identifying, by the computer, a first dataset and a second dataset as join candidates. The method also includes joining, by the computer, the first dataset and the second dataset when, each row of the first user dataset is associated with a single geographic identifier using a geographic knowledge dataset that includes a geographic name lookup table and each row of the second user dataset is associated with a single geographic identifier using the geographic knowledge dataset, wherein the geographic name lookup table includes a plurality of alias names for each of a plurality of unique geographic locations.
Public/Granted literature
- US20210248137A1 METHOD TO AUTOMATICALLY JOIN DATASETS WITH DIFFERENT GEOGRAPHIC LOCATION NAMING CONVENTIONS Public/Granted day:2021-08-12
Information query