Invention Grant
- Patent Title: Global de-duplication in shared architectures
- Patent Title (中): 共享架构中的全局重复数据删除
-
Application No.: US13464017Application Date: 2012-05-04
-
Publication No.: US08423726B2Publication Date: 2013-04-16
- Inventor: Jedidiah Yueh
- Applicant: Jedidiah Yueh
- Applicant Address: US MA Hopkinton
- Assignee: EMC Corporation
- Current Assignee: EMC Corporation
- Current Assignee Address: US MA Hopkinton
- Agency: Workman Nydegger
- Main IPC: G06F12/00
- IPC: G06F12/00 ; G06F13/00 ; G06F13/28

Abstract:
Redundant data is globally de-duplicated across a shared architecture that includes a plurality of storage systems. The storage systems implement copy-on-write or WAFL to generate snapshots of original data. Each storage system includes a de-duplication client to identify and reduce redundant original and/or snapshot data on the storage system. Each de-duplication client can de-duplicate a digital sequence by breaking the sequence into blocks and identifying redundant blocks already stored in the shared architecture. Identifying redundant blocks may include hashing each block and comparing the hash to a local and/or master hash table containing hashes of existing data. Once identified, redundant data previously stored is deleted (e.g., post-process de-duplication), or redundant data is not stored to begin with (e.g., inline de-duplication). In both cases, pointers to shared data blocks can be used to reassemble the digital sequence where one or more blocks were deleted or not stored on the storage system.
Public/Granted literature
- US20120221817A1 GLOBAL DE-DUPLICATION IN SHARED ARCHITECTURES Public/Granted day:2012-08-30
Information query