Method to automatically join datasets with different geographic location naming conventions
Abstract:
A computer-implemented method for joining data sets with mismatched geographic location naming conventions is provided. The method includes identifying, by the computer, a first dataset and a second dataset as join candidates. The method also includes joining, by the computer, the first dataset and the second dataset when, each row of the first user dataset is associated with a single geographic identifier using a geographic knowledge dataset that includes a geographic name lookup table and each row of the second user dataset is associated with a single geographic identifier using the geographic knowledge dataset, wherein the geographic name lookup table includes a plurality of alias names for each of a plurality of unique geographic locations.
Information query
Patent Agency Ranking
0/0