Invention Grant
- Patent Title: Providing data deduplication in a data storage system with parallelized computation of crypto-digests for blocks of host I/O data
-
Application No.: US16450390Application Date: 2019-06-24
-
Publication No.: US10936228B2Publication Date: 2021-03-02
- Inventor: Istvan Gonczi , Ivan Bassov , Philippe Armangau
- Applicant: EMC IP Holding Company LLC
- Applicant Address: US MA Hopkinton
- Assignee: EMC IP Holding Company LLC
- Current Assignee: EMC IP Holding Company LLC
- Current Assignee Address: US MA Hopkinton
- Agency: BainwoodHuang
- Main IPC: G06F3/06
- IPC: G06F3/06 ; H04L9/06 ; G06F12/0891 ; G06F15/80 ; H04L9/08

Abstract:
In response to a cache flush event indicating that host data accumulated in a cache of a storage processor of a data storage system is to be flushed to a lower deck file system, an aggregation set of blocks is formed within the cache, and a digest calculation group is selected from within the aggregation set. Hardware vector processing logic is caused to simultaneously calculate crypto-digests from the blocks in the digest calculation group. If one of the resulting crypto-digests matches a previously generated crypto-digest, deduplication is performed that i) causes the lower deck file system to indicate the block of data from which the previously generated crypto-digest was generated and ii) discards the block that corresponds to the matching crypto-digest. Objects required by a digest generation component may be allocated in a just in time manner to avoid having to manage a pool of pre-allocated objects.
Public/Granted literature
Information query