Invention Grant
- Patent Title: Apparatus and method to deduplicate data
- Patent Title (中): 用于重复数据删除的设备和方法
-
Application No.: US12404988Application Date: 2009-03-16
-
Publication No.: US08346736B2Publication Date: 2013-01-01
- Inventor: Nils Haustein , Craig Anthony Klein , Stephen Leonard Schwartz , Daniel James Winarski
- Applicant: Nils Haustein , Craig Anthony Klein , Stephen Leonard Schwartz , Daniel James Winarski
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Quarles & Brady LLP
- Agent Dale F. Regelman
- Main IPC: G06F7/00
- IPC: G06F7/00

Abstract:
A method to deduplicate data by receiving a data set, setting a data chunk size, selecting a first stage deduplication algorithm, and selecting a second stage deduplication algorithm, where the first stage deduplication algorithm differs from the second stage deduplication algorithm. The method selects a data chunk, where that data chunk comprises all or a portion of the data set, performs a first stage deduplication analysis of the data chunk using the first stage deduplication algorithm. If the first stage deduplication analysis indicates duplicate data, then the method performs a second state deduplication analysis of said data chunk using the second stage deduplication algorithm to verify the data as duplicate. Only if both data deduplication analysis indicate duplicate data the data chunk is replaced by a deduplication stub or reference to the identical data chunk which is already stored.
Public/Granted literature
- US20100235332A1 APPARATUS AND METHOD TO DEDUPLICATE DATA Public/Granted day:2010-09-16
Information query