Invention Grant
- Patent Title: Tunable data fingerprinting for optimizing data deduplication
- Patent Title (中): 可调整的数据指纹识别,用于优化重复数据删除
-
Application No.: US12113136Application Date: 2008-04-30
-
Publication No.: US08620877B2Publication Date: 2013-12-31
- Inventor: Mark Andrew Smith
- Applicant: Mark Andrew Smith
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Leonard T. Guzman; Mohammed Kashef
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/00

Abstract:
The present invention provides a method and system of performing de-duplication for at least one computer file in a computer system. In an exemplary embodiment, the method and system include (1) tuning a rolling-hash algorithm for the de-duplication, (2) chunking the data in the file into chunks of data by using the tuned algorithm, (3) producing a content identifier for each of the chunks, and (4) processing the chunks that are unique, the content identifier for each of the chunks that are unique, and references to the chunks that are unique. In an exemplary embodiment, the computer system includes a de-duplication-enabled data store. In an exemplary embodiment, the computer system includes (a) a transferor computer system that is configured to transfer the file to a de-duplication-enabled computer system and (b) the de-duplication-enabled computer system.
Public/Granted literature
- US20090276454A1 PERFORMING DE-DUPLICATION FOR AT LEAST ONE COMPUTER FILE IN A COMPUTER SYSTEM Public/Granted day:2009-11-05
Information query