System and method for estimating storage savings from deduplication
    1.
    发明授权
    System and method for estimating storage savings from deduplication 有权
    用于估算重复数据删除存储节省的系统和方法

    公开(公告)号:US09152333B1

    公开(公告)日:2015-10-06

    申请号:US13768191

    申请日:2013-02-15

    Applicant: NetApp, Inc.

    CPC classification number: G06F3/0641 G06F3/0608 G06F3/067 G06F3/0673

    Abstract: Techniques for a method of estimating deduplication potential are disclosed herein. The method includes steps of selecting randomly a plurality of data blocks from a data set as a sample of the data set, collecting fingerprints of the plurality of data blocks of the sample, identifying duplicates of fingerprints of the sample from the fingerprints of the plurality of data blocks, estimating a total number of unique fingerprints of the data set depending on a total number of the duplicates of fingerprints of the sample based on a probability of fingerprints from the data set colliding in the sample, and determining a total number of duplicates of fingerprints of the data set depending on the total number of the unique fingerprints of the data set.

    Abstract translation: 本文中公开了一种估算重复数据消除潜力的方法。 该方法包括以下步骤:从作为数据集的样本的数据集中随机选择多个数据块,收集样本的多个数据块的指纹,从多个数据集的指纹中识别样本的指纹的重复 数据块,基于来自与样本相冲突的数据集的指纹的概率,根据所述样本的指纹的副本的总数来估计所述数据集的唯一指纹的总数,并且确定所述样本的副本的总数 取决于数据集的唯一指纹的总数的数据集的指纹。

Patent Agency Ranking