Invention Grant
- Patent Title: Block level deduplication with block similarity
-
Application No.: US15096410Application Date: 2016-04-12
-
Publication No.: US10783145B2Publication Date: 2020-09-22
- Inventor: Vitaly Stanislavovitch Kozlovsky , Aleksandr Valentinovich Shadrin , Inga Sergeyevna Petryaevskaya
- Applicant: EMC IP Holding Company LLC
- Applicant Address: US MA Hopkinton
- Assignee: EMC IP Holding Company LLC
- Current Assignee: EMC IP Holding Company LLC
- Current Assignee Address: US MA Hopkinton
- Agency: Ryan, Mason & Lewis, LLP
- Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@34055254
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06F16/2455 ; G06F16/2457 ; G06F16/174

Abstract:
Methods and apparatus are provided for block similarity based block level deduplication of data. An exemplary method comprises obtaining a deduplicated dataset comprising a plurality of unique data chunks; determining a number of differences between two of the unique data chunks; evaluating whether the number of differences satisfies a predefined similarity criteria (e.g., that the number of bit differences cannot exceed a specified limit); and storing metadata for a first one of the two unique data chunks if the predefined similarity criteria is satisfied for the two unique data chunks, wherein the metadata comprises a pointer to a second one of the two unique data chunks and bit differences between the two unique data chunks. The bit differences comprise an executable code and/or a bit mask. The predefined similarity threshold is optionally a tunable parameter. The first one of the two unique data chunks can be restored by processing the metadata.
Public/Granted literature
- US20170083581A1 Block Level Deduplication with Block Similarity Public/Granted day:2017-03-23
Information query