Method, system, and computer program product for managing source identifiers of clustered records
Abstract:
Provided is a method for managing source identifiers of clustered records, which may include receiving a plurality of records, each having identification data associated with a source thereof. The records may be sorted into a first plurality of clusters. A first source identifier may be generated for each cluster. Update data associated with an update to the records may be received. At least some of the records may be sorted into a second plurality of clusters. A first cluster of the second plurality of clusters may have first identification data different from at least one cluster of the first plurality of clusters, and a second cluster of the second plurality of clusters may have identification data that substantially matches the at least one cluster. Whether to assign the first source identifier to the first or second cluster may be determined. A system and computer program product are also disclosed.
Information query
Patent Agency Ranking
0/0