IMPROVED FINGERPRINTS DATASTORE AND STALE FINGERPRINT REMOVAL IN DE-DUPLICATION ENVIRONMENTS
    1.
    发明申请
    IMPROVED FINGERPRINTS DATASTORE AND STALE FINGERPRINT REMOVAL IN DE-DUPLICATION ENVIRONMENTS 审中-公开
    改进指纹环境中的改进指纹数据和标记指纹去除

    公开(公告)号:WO2012083085A1

    公开(公告)日:2012-06-21

    申请号:PCT/US2011/065292

    申请日:2011-12-15

    Abstract: A storage server is coupled to a storage device that stores blocks of data, and generates a fingerprint for each data block stored on the storage device. The storage server creates a fingerprints datastore that is divided into a primary datastore and a secondary datastore. The primary datastore comprises a single entry for each unique fingerprint and the secondary datastore comprises an entry having an identical fingerprint as an entry in the primary datastore. The storage server merges entries in a changelog with the entries in the primary datastore to identify duplicate data blocks in the storage device and frees the identified duplicate data blocks in the storage device. The storage server stores the entries that correspond to the freed data blocks to a third datastore and overwrites the primary datastore with the entries from the merged data that correspond to the unique fingerprints to create an updated primary datastore.

    Abstract translation: 存储服务器耦合到存储数据块的存储设备,并且为存储在存储设备上的每个数据块生成指纹。 存储服务器创建一个指纹数据存储区,分为主数据存储和辅助数据存储。 主数据存储区包括每个唯一指纹的单个条目,辅助数据存储区包括具有与主数据存储区中的条目相同的指纹的条目。 存储服务器将更改日志中的条目与主数据存储中的条目合并,以识别存储设备中的重复数据块,并释放存储设备中标识的重复数据块。 存储服务器将与释放的数据块对应的条目存储到第三个数据存储,并使用与唯一指纹对应的合并数据中的条目覆盖主数据存储,以创建更新的主数据存储。

    FRAGMENTATION CONTROL FOR PERFORMING DEDUPLICATION OPERATIONS
    2.
    发明申请
    FRAGMENTATION CONTROL FOR PERFORMING DEDUPLICATION OPERATIONS 审中-公开
    执行重复操作的分段控制

    公开(公告)号:WO2013130410A1

    公开(公告)日:2013-09-06

    申请号:PCT/US2013/027673

    申请日:2013-02-25

    Applicant: NETAPP, INC.

    CPC classification number: G06F17/30156 G06F17/30336 G06F17/30864

    Abstract: The techniques introduced here provide for enabling deduplication operations for a file system without significantly affecting read performance of the file system due to fragmentation of the data sets in the file system. The techniques include determining, by a storage server that hosts the file system, a level of fragmentation that would be introduced to a data set stored in the file system as a result of performing a deduplication operation on the data set. The storage server then compares the level of fragmentation with a threshold value and determines whether to perform the deduplication operation based on a result of comparing the level of fragmentation with the threshold value. The threshold value represents an acceptable level of fragmentation in the data sets of the file system.

    Abstract translation: 这里介绍的技术提供了对文件系统进行重复数据删除操作,而不会由于文件系统中的数据集的碎片而显着影响文件系统的读取性能。 这些技术包括由承载文件系统的存储服务器确定作为对该数据集执行重复数据删除操作的结果,该层级将被引入存储在文件系统中的数据集。 然后,存储服务器将分段级别与阈值进行比较,并且基于将分段级别与阈值进行比较的结果来确定是否执行重复数据消除操作。 阈值表示文件系统的数据集中可接受的碎片级别。

    INLINE DEDUPLICATION
    3.
    发明申请
    INLINE DEDUPLICATION 审中-公开
    在线重复使用

    公开(公告)号:WO2017189476A1

    公开(公告)日:2017-11-02

    申请号:PCT/US2017/029242

    申请日:2017-04-25

    Applicant: NETAPP, INC.

    Abstract: One or more techniques and/or computing devices are provided for inline deduplication. For example, a checksum hash table and/or a block number hash table may be maintained within memory (e.g., a storage controller may maintain the hash tables in-core). The checksum hash table may be utilized for inline deduplication to identify potential donor blocks that may comprise the same data as an incoming storage operation. Data within an in-core buffer cache is eligible as potential donor blocks so that inline deduplication may be performed using data from the in-core buffer cache, which may mitigate disk access to underlying storage for which the in- core buffer cache is used for caching. The block number hash table may be used for updating or removing entries from the hash tables, such as for blocks that are no longer eligible as potential donor blocks (e.g., deleted blocks, blocks evicted from the in-core buffer cache, etc.).

    Abstract translation: 为线内重复数据删除提供一个或多个技术和/或计算设备。 例如,可以在存储器内维护校验和散列表和/或块号散列表(例如,存储控制器可以将散列表维护在内核中)。 校验和哈希表可用于内联去重复以识别可能包括与进入的存储操作相同的数据的潜在施主块。 核内缓冲区高速缓存内的数据适合作为潜在的施主模块,以便可以使用来自核心内缓冲区高速缓存的数据执行内联重复数据消除,这可能会缓解对使用内核缓冲区高速缓存的底层存储的磁盘访问 缓存。 块号散列表可用于更新或从散列表中删除条目,例如用于不再适合作为潜在施主块的块(例如,删除块,从核内缓冲区高速缓存中逐出的块等)

    IMPROVED FINGERPRINTS DATASTORE AND STALE FINGERPRINT REMOVAL IN DE-DUPLICATION ENVIRONMENTS
    6.
    发明公开
    IMPROVED FINGERPRINTS DATASTORE AND STALE FINGERPRINT REMOVAL IN DE-DUPLICATION ENVIRONMENTS 审中-公开
    改进的指纹数据存储和ALTFINGERABDRÜCKEN作者ENTDUPLIZIERUNG ENVIRONMENTS拆除

    公开(公告)号:EP2652644A1

    公开(公告)日:2013-10-23

    申请号:EP11808463.1

    申请日:2011-12-15

    Applicant: NetApp, Inc.

    Abstract: A storage server is coupled to a storage device that stores blocks of data, and generates a fingerprint for each data block stored on the storage device. The storage server creates a fingerprints datastore that is divided into a primary datastore and a secondary datastore. The primary datastore comprises a single entry for each unique fingerprint and the secondary datastore comprises an entry having an identical fingerprint as an entry in the primary datastore. The storage server merges entries in a changelog with the entries in the primary datastore to identify duplicate data blocks in the storage device and frees the identified duplicate data blocks in the storage device. The storage server stores the entries that correspond to the freed data blocks to a third datastore and overwrites the primary datastore with the entries from the merged data that correspond to the unique fingerprints to create an updated primary datastore.

    FRAGMENTATION CONTROL FOR PERFORMING DEDUPLICATION OPERATIONS
    8.
    发明公开
    FRAGMENTATION CONTROL FOR PERFORMING DEDUPLICATION OPERATIONS 审中-公开
    分段控制于执行ENTDUPLIZIERUNGSOPERATIONEN

    公开(公告)号:EP2820545A1

    公开(公告)日:2015-01-07

    申请号:EP13755844.1

    申请日:2013-02-25

    Applicant: NetApp, Inc.

    CPC classification number: G06F16/1748 G06F16/2272 G06F16/951

    Abstract: The techniques introduced here provide for enabling deduplication operations for a file system without significantly affecting read performance of the file system due to fragmentation of the data sets in the file system. The techniques include determining, by a storage server that hosts the file system, a level of fragmentation that would be introduced to a data set stored in the file system as a result of performing a deduplication operation on the data set. The storage server then compares the level of fragmentation with a threshold value and determines whether to perform the deduplication operation based on a result of comparing the level of fragmentation with the threshold value. The threshold value represents an acceptable level of fragmentation in the data sets of the file system.

Patent Agency Ranking