Invention Grant
US09122639B2 Detection and deduplication of backup sets exhibiting poor locality 有权
备份集的检测和重复数据删除表现不佳

Detection and deduplication of backup sets exhibiting poor locality
Abstract:
Described are computer-based methods and apparatuses, including computer program products, for detection and deduplication of backup sets exhibiting poor locality. A first set of summaries of a first data set are determined, each summary of the first set of summaries being indicative of a data pattern in the first data set. A second set of summaries of a second data set are determined, each summary of the second set of summaries being indicative of a data pattern in the second data set. A set of comparison metrics are calculated, each comparison metric being based on a first subset of summaries from the first set of summaries and a second subset of summaries from the second set of summaries. A locality metric is calculated based on the set of comparison metrics indicative of whether the first data set and second data set exhibit poor locality.
Information query
Patent Agency Ranking
0/0