- Patent Title: System and method for efficiently measuring physical space for an ad-hoc subset of files in protection storage filesystem with stream segmentation and data deduplication
-
Application No.: US16380815Application Date: 2019-04-10
-
Publication No.: US11269817B2Publication Date: 2022-03-08
- Inventor: Guilherme Menezes , Fabiano Botelho , Abdullah Reza
- Applicant: EMC IP Holding Company LLC
- Applicant Address: US MA Hopkinton
- Assignee: EMC IP Holding Company LLC
- Current Assignee: EMC IP Holding Company LLC
- Current Assignee Address: US MA Hopkinton
- Agency: Workman Nydegger
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06F16/182 ; G06F11/14 ; G06F16/11 ; G06F16/9535 ; G06F16/174

Abstract:
In one example, a method includes measuring an amount of physical storage space used, or expected to be used, by a portion of a dataset S of segments, and measuring the amount of physical storage space includes receiving information that identifies an ad-hoc group of size ‘n’ of files F1 . . . Fn that makes up a subset of the dataset S, determining a number of unique segments in the dataset S, identifying a respective unique segment set UF1 . . . UFN for each of the ‘n’ files in the ad-hoc group of files, performing a set union operation on the unique segment sets UF1 . . . UFN, and determining a sum of sizes of the unique segment sets UF1 . . . UFN, where the sum is the amount of physical storage space used or expected to be used by the ad-hoc group of size ‘n’ of files F1 . . . Fn.
Public/Granted literature
Information query