Locality aware, two-level fingerprint caching
    1.
    发明授权
    Locality aware, two-level fingerprint caching 有权
    局部感知,二级指纹缓存

    公开(公告)号:US09495379B2

    公开(公告)日:2016-11-15

    申请号:US13646852

    申请日:2012-10-08

    CPC classification number: G06F17/30156 G06F11/1453 G06F2201/81 H04L67/2847

    Abstract: The present disclosure provides for implementing a two-level fingerprint caching scheme for a client cache and a server cache. The client cache hit ratio can be improved by pre-populating the client cache with fingerprints that are relevant to the client. Relevant fingerprints include fingerprints used during a recent time period (e.g., fingerprints of segments that are included in the last full backup image and any following incremental backup images created for the client after the last full backup image), and thus are referred to as fingerprints with good temporal locality. Relevant fingerprints also include fingerprints associated with a storage container that has good spatial locality, and thus are referred to as fingerprints with good spatial locality. A pre-set threshold established for the client cache (e.g., threshold Tc) is used to determine whether a storage container (and thus fingerprints associated with the storage container) has good spatial locality.

    Abstract translation: 本公开提供了实现用于客户机高速缓存和服务器高速缓存的两级指纹高速缓存方案。 可以通过使用与客户端相关的指纹预先填充客户端缓存来提高客户端缓存命中率。 相关指纹包括在最近一段时间内使用的指纹(例如,最后一个完整备份图像中包含的片段的指纹以及在最后一个完整备份图像之后为客户端创建的任何后续增量备份映像),因此被称为指纹 具有良好的时间局部性。 相关指纹还包括与具有良好空间局部性的存储容器相关联的指纹,因此被称为具有良好空间局部性的指纹。 为客户端缓存建立的预设阈值(例如,阈值Tc)用于确定存储容器(以及因此与存储容器相关联的指纹)是否具有良好的空间局部性。

    Autonomous deduplication for efficient recovery for a deduplication system

    公开(公告)号:US09619479B1

    公开(公告)日:2017-04-11

    申请号:US14316632

    申请日:2014-06-26

    CPC classification number: G06F17/30156

    Abstract: A method to partition a deduplication pool is provided. The method includes determining that an amount of data in a plurality of data containers of the deduplication pool has reached a data capacity threshold and comparing each data container of the plurality of data containers with at least one other of the plurality of data containers as to amount of shared data. The method includes grouping, based on results of the comparing, the plurality of data containers into a plurality of groups of data containers, with data sharing from each of the plurality of groups of data containers to each other of the plurality of groups of data containers less than a data sharing threshold and data sharing inside each of the plurality of groups of data containers greater than the data sharing threshold.

    Data locality control for deduplication
    3.
    发明授权
    Data locality control for deduplication 有权
    重复数据删除的数据区域控制

    公开(公告)号:US09367559B1

    公开(公告)日:2016-06-14

    申请号:US14094527

    申请日:2013-12-02

    Abstract: A method for data locality control in a deduplication system is provided. The method includes forming a fingerprint cache from a backup image corresponding to a first backup operation. The method includes removing one or more fingerprints from inclusion in the fingerprint cache, in response to the one or more fingerprints having a data segment locality, in a container, less than a threshold of data segment locality. The container has one or more data segments corresponding to the one or more fingerprints. The method includes applying the fingerprint cache, with the one or more fingerprints removed from inclusion therein, to a second backup operation, wherein at least one method operation is executed through a processor.

    Abstract translation: 提供了一种重复数据删除系统中的数据位置控制方法。 该方法包括从对应于第一备份操作的备份映像形成指纹高速缓存。 响应于容器中具有数据段位置的一个或多个指纹小于数据段位置的阈值,该方法包括移除指纹高速缓存中包含的一个或多个指纹。 容器具有与一个或多个指纹对应的一个或多个数据段。 该方法包括将指纹高速缓冲存储器中的一个或多个指纹从其包含的内容中删除到第二备份操作,其中通过处理器执行至少一个方法操作。

    Autonomous container group management for scale out deduplication system

    公开(公告)号:US09626253B1

    公开(公告)日:2017-04-18

    申请号:US14316636

    申请日:2014-06-26

    Abstract: A method for data container group management in a deduplication system is provided. The method includes arranging a plurality of data container groups according to a plurality of file systems. A subset of the plurality of data container groups correspond to each of the plurality of file systems, each of the plurality of data container groups having a reference database, a plurality of data containers, and a data container group identifier (ID). The method includes performing a first backup process for a first client-policy pair with deduplication via a first one of the plurality of data container groups and performing a second backup process for a second client-policy pair with deduplication via a second one of the plurality of data container groups.

    DEDUPLICATION STORAGE SYSTEM WITH EFFICIENT REFERENCE UPDATING AND SPACE RECLAMATION
    5.
    发明申请
    DEDUPLICATION STORAGE SYSTEM WITH EFFICIENT REFERENCE UPDATING AND SPACE RECLAMATION 审中-公开
    具有有效参考更新和空间重新定位的重复存储系统

    公开(公告)号:US20140244599A1

    公开(公告)日:2014-08-28

    申请号:US13774083

    申请日:2013-02-22

    Abstract: A deduplication storage system and associated methods are described. The deduplication storage system may split data objects into segments and store the segments. A plurality of data segment containers may be maintained. Each of the containers may include two or more of the data segments. Maintaining the containers may include maintaining a respective logical size of each container. In response to detecting that the logical size of a particular container has fallen below a threshold level, the deduplication storage system may perform an operation to reclaim the storage space allocated to one or more of the data segments included in the particular container.

    Abstract translation: 描述重复数据删除存储系统及相关方法。 重复数据消除存储系统可能会将数据对象分割成段并存储段。 可以维护多个数据段容器。 每个容器可以包括两个或更多个数据段。 维护容器可以包括保持每个容器的相应逻辑尺寸。 响应于检测到特定容器的逻辑大小已经低于阈值水平,重复数据消除存储系统可以执行操作以回收分配给包括在特定容器中的一个或多个数据段的存储空间。

    Locality Aware, Two-Level Fingerprint Caching
    6.
    发明申请
    Locality Aware, Two-Level Fingerprint Caching 有权
    局部感知,两级指纹缓存

    公开(公告)号:US20140101113A1

    公开(公告)日:2014-04-10

    申请号:US13646852

    申请日:2012-10-08

    CPC classification number: G06F17/30156 G06F11/1453 G06F2201/81 H04L67/2847

    Abstract: The present disclosure provides for implementing a two-level fingerprint caching scheme for a client cache and a server cache. The client cache hit ratio can be improved by pre-populating the client cache with fingerprints that are relevant to the client. Relevant fingerprints include fingerprints used during a recent time period (e.g., fingerprints of segments that are included in the last full backup image and any following incremental backup images created for the client after the last full backup image), and thus are referred to as fingerprints with good temporal locality. Relevant fingerprints also include fingerprints associated with a storage container that has good spatial locality, and thus are referred to as fingerprints with good spatial locality. A pre-set threshold established for the client cache (e.g., threshold Tc) is used to determine whether a storage container (and thus fingerprints associated with the storage container) has good spatial locality.

    Abstract translation: 本公开提供了实现用于客户机高速缓存和服务器高速缓存的两级指纹高速缓存方案。 可以通过使用与客户端相关的指纹预先填充客户端缓存来提高客户端缓存命中率。 相关指纹包括在最近一段时间内使用的指纹(例如,最后一个完整备份图像中包含的片段的指纹以及在最后一个完整备份图像之后为客户端创建的任何后续增量备份映像),因此被称为指纹 具有良好的时间局部性。 相关指纹还包括与具有良好空间局部性的存储容器相关联的指纹,因此被称为具有良好空间局部性的指纹。 为客户端缓存建立的预设阈值(例如,阈值Tc)用于确定存储容器(以及因此与存储容器相关联的指纹)是否具有良好的空间局部性。

    Handling data segments in deduplication
    7.
    发明授权
    Handling data segments in deduplication 有权
    处理重复数据删除中的数据段

    公开(公告)号:US09442807B1

    公开(公告)日:2016-09-13

    申请号:US13935405

    申请日:2013-07-03

    CPC classification number: G06F17/30159

    Abstract: In some embodiments, a method of maintaining a reference list for data deduplication is provided. The method includes discarding a newly arriving data segment in response to finding a fingerprint of the newly arriving data segment matches an existing fingerprint in a plurality of fingerprints on a fingerprint-to-file reference list. The method includes adding, in the fingerprint-to-file reference list, to a list for the existing fingerprint, a source for the newly arriving data segment, in response to the fingerprint-to-file reference list indicating the existing fingerprint does not correspond to a hot data segment and setting an indication in the fingerprint-to-file reference list that the existing fingerprint corresponds to the hot data segment in response to the list for the existing fingerprint meeting or exceeding a predetermined number of entries. Other embodiments are included.

    Abstract translation: 在一些实施例中,提供了维护用于重复数据删除的参考列表的方法。 该方法包括:响应于找到新到达的数据段的指纹,匹配指纹到文件参考列表上的多个指纹中的现有指纹,丢弃新到达的数据段。 该方法包括在指纹到文件参考列表中添加现有指纹的列表,响应于指示现有指纹的指纹到文件参考列表,新到达的数据段的源不对应 到达热数据段并且在指纹到文件参考列表中设置现有指纹对应于热数据段的指示,以响应于现有指纹会议的列表或超过预定数量的条目。 包括其他实施例。

    Systems and methods for managing deduplication reference data
    8.
    发明授权
    Systems and methods for managing deduplication reference data 有权
    用于管理重复数据删除引用数据的系统和方法

    公开(公告)号:US08954398B1

    公开(公告)日:2015-02-10

    申请号:US13734901

    申请日:2013-01-04

    CPC classification number: G06F17/30156

    Abstract: A method for managing deduplication reference data may include (1) identifying multiple of data containers configured to store a plurality of deduplicated data segments that are referenced by multiple data objects within a deduplicated data system, (2) maintaining multiple reference databases including (i) a first reference database corresponding to a first subset of the data containers and (ii) a second reference database corresponding to a second subset of the data containers, the second subset differing from the first subset, (3) determining that a data object references at least one segment within a first data container within the first subset but does not reference any data segment within a second data container within the second subset and (4) updating the first reference database with information specifying that the data object references at least one data segment within at least one data container within the first subset of data containers.

    Abstract translation: 用于管理重复数据删除参考数据的方法可以包括(1)识别被配置为存储由重复数据删除的数据系统内的多个数据对象引用的多个重复数据删除的数据段的多个数据容器,(2)维护多个参考数据库,包括(i) 对应于数据容器的第一子集的第一参考数据库和(ii)对应于数据容器的第二子集的第二参考数据库,第二子集与第一子集不同,(3)确定数据对象在 在第一子集内的第一数据容器内的至少一个段,但不引用第二子集内的第二数据容器内的任何数据段,以及(4)用指定数据对象引用至少一个数据段的信息更新第一参考数据库 在数据容器的第一子集内的至少一个数据容器内。

Patent Agency Ranking