Combining data matches from multiple sources in a deduplication storage system
Abstract:
Embodiments for combining input data matches in data deduplication of input data by a processor. Matches of input data are calculated using a plurality of independent deduplication processes referencing a plurality of repository data segments for the input data. A combined list of output data matches is calculated by removing those of the input data matches that are fully enclosed within other input data matches; and removing those of the input data matches determined to be smaller than a predetermined threshold for citing. A deduplication operation is performed on the combined list of output data matches. Each pair of the input data matches having an overlap section is processed in an ascending order of a position.
Information query
Patent Agency Ranking
0/0