Back-reference data structure for a deduplication storage system

    公开(公告)号:US12130707B2

    公开(公告)日:2024-10-29

    申请号:US18185202

    申请日:2023-03-16

    CPC classification number: G06F11/1453 G06F2201/84

    Abstract: Example implementations relate to deduplication operations in a storage system. An example includes generating a housekeeping work map to delete a backup item stored in a deduplication storage system; selecting a first work entry of the housekeeping work map, where the first work entry identifies a first container index and a first manifest; in response to a selection of the first work entry, loading the first container index into the memory, the first container index comprising a back-reference data structure; identifying, in the back-reference data structure, a back-reference entry indexed to the first manifest; determining, using the back-reference entry indexed to the first manifest, a first set of data units included in the first manifest and that are indexed in the first container index; and decrementing, in the first container index, a set of reference counts for the determined first set of data units.

    INLINE HOUSEKEEPING FOR A DEDUPLICATION STORAGE SYSTEM

    公开(公告)号:US20240362122A1

    公开(公告)日:2024-10-31

    申请号:US18307960

    申请日:2023-04-27

    CPC classification number: G06F11/1453 G06F11/1461 G06F2201/84

    Abstract: Example implementations relate to deduplication operations in a storage system. An example includes receiving a stream of data units to be stored in a persistent storage of a deduplication storage system; loading a first container index into the memory to index one or more data units included in the stream; updating the first container index to indicate a storage location of a first data unit included in the stream; reading a manifest list in the first container index to identify a first manifest indexed by the first container index; and, while the first container index remains loaded in the memory to index the one or more data units, in response to a determination that the identified first manifest is included in the first backup item that has been marked for deletion, decrementing a reference count of a second data unit.

    BACK-REFERENCE DATA STRUCTURE FOR A DEDUPLICATION STORAGE SYSTEM

    公开(公告)号:US20240311255A1

    公开(公告)日:2024-09-19

    申请号:US18185202

    申请日:2023-03-16

    CPC classification number: G06F11/1453 G06F2201/84

    Abstract: Example implementations relate to deduplication operations in a storage system. An example includes generating a housekeeping work map to delete a backup item stored in a deduplication storage system; selecting a first work entry of the housekeeping work map, where the first work entry identifies a first container index and a first manifest; in response to a selection of the first work entry, loading the first container index into the memory, the first container index comprising a back-reference data structure; identifying, in the back-reference data structure, a back-reference entry indexed to the first manifest; determining, using the back-reference entry indexed to the first manifest, a first set of data units included in the first manifest and that are indexed in the first container index; and decrementing, in the first container index, a set of reference counts for the determined first set of data units.

    Matching operation for a deduplication storage system

    公开(公告)号:US12061581B2

    公开(公告)日:2024-08-13

    申请号:US17814943

    申请日:2022-07-26

    CPC classification number: G06F16/215 G06F16/245

    Abstract: Example implementations relate to metadata operations in a storage system. An example includes generating, by a storage controller of a deduplication storage system, a candidate list of container indexes for matching operations of a received data segment, each container index in the candidate list having an associated match cost; identifying, by the storage controller, a journal group associated with a first container index listed in the candidate list; reducing, by the storage controller, a match cost associated with the first container index in response to a determination that the identified journal group is in a modified state; and performing, by the storage controller, the matching operations of the received data segment based at least on the reduced match cost of the first container index.

    HOUSEKEEPING WORK MAP FOR A DEDUPLICATION STORAGE SYSTEM

    公开(公告)号:US20240311363A1

    公开(公告)日:2024-09-19

    申请号:US18185184

    申请日:2023-03-16

    CPC classification number: G06F16/2365 G06F16/2272

    Abstract: Example implementations relate to deduplication operations in a storage system. An example includes initiating a housekeeping job to delete a backup item stored in a deduplication storage system; identifying a set of container indexes affected by the housekeeping job, the set of container indexes including at least a first container index; in response to a determination that a housekeeping work map includes a first work entry associated with the first container index, determining an ingest count of new data units added to a first container during an ingest of the backup item into the deduplication storage system, the first container associated with the first container index; and adding the determined ingest count to the cumulative counter included in the first work entry associated with the first container index.

    MATCHING OPERATION FOR A DEDUPLICATION STORAGE SYSTEM

    公开(公告)号:US20240037078A1

    公开(公告)日:2024-02-01

    申请号:US17814943

    申请日:2022-07-26

    CPC classification number: G06F16/215 G06F16/245

    Abstract: Example implementations relate to metadata operations in a storage system. An example includes generating, by a storage controller of a deduplication storage system, a candidate list of container indexes for matching operations of a received data segment, each container index in the candidate list having an associated match cost; identifying, by the storage controller, a journal group associated with a first container index listed in the candidate list; reducing, by the storage controller, a match cost associated with the first container index in response to a determination that the identified journal group is in a modified state; and performing, by the storage controller, the matching operations of the received data segment based at least on the reduced match cost of the first container index.

Patent Agency Ranking