System and method for balancing compression and read performance in a storage system

    公开(公告)号:US10216754B1

    公开(公告)日:2019-02-26

    申请号:US14038632

    申请日:2013-09-26

    Abstract: Techniques for balancing data compression and read performance of data chunks of a storage system are described herein. According to one embodiment, similar data chunks are identified based on sketches of a plurality of data chunks stored in the storage system. A first portion of the similar data chunks as a first group is associated with a first storage area. The first storage area is associated with one or more data chunks that are dissimilar to the first group but are likely accessed together. The first group of the similar data chunks and its associated dissimilar data chunks are compressed and stored in the first storage area.

    System and method for improving data compression of a storage system in an online manner

    公开(公告)号:US09767154B1

    公开(公告)日:2017-09-19

    申请号:US15336769

    申请日:2016-10-27

    CPC classification number: G06F17/30501 H03M7/3091

    Abstract: Techniques for improving data compression of a storage system in an online manner are described herein. According to one embodiment, in response to a sequence of data to be stored, the sequence of data is partitioned into a plurality of data chunks according to a predetermined chunking algorithm. A sketch for each of the data chunks is generated based on one or more features extracted from the data chunk. Each of the data chunks of the sequence of data is associated with one of a plurality of groups based on the sketch, wherein each group is represented by a sketch. The data chunks of each group are compressed and stored in a compression region of the storage systems, such that similar data chunks are compressed and stored in the same compression region.

    System and method for improving data compression of a storage system in an online manner
    5.
    发明授权
    System and method for improving data compression of a storage system in an online manner 有权
    用于以在线方式改进存储系统的数据压缩的系统和方法

    公开(公告)号:US09514146B1

    公开(公告)日:2016-12-06

    申请号:US14038635

    申请日:2013-09-26

    CPC classification number: G06F17/30501 H03M7/3091

    Abstract: Techniques for improving data compression of a storage system in an online manner are described herein. According to one embodiment, in response to a sequence of data to be stored, the sequence of data is partitioned into a plurality of data chunks according to a predetermined chunking algorithm. A sketch for each of the data chunks is generated based on one or more features extracted from the data chunk. Each of the data chunks of the sequence of data is associated with one of a plurality of groups based on the sketch, wherein each group is represented by a sketch. The data chunks of each group are compressed and stored in a compression region of the storage systems, such that similar data chunks are compressed and stored in the same compression region.

    Abstract translation: 本文描述了以在线方式改进存储系统的数据压缩的技术。 根据一个实施例,响应于要存储的数据序列,根据预定的分组算法将数据序列划分成多个数据块。 基于从数据块提取的一个或多个特征来生成每个数据块的草图。 基于草图,数据序列的每个数据块与多个组中的一个相关联,其中每个组由草图表示。 每个组的数据块被压缩并存储在存储系统的压缩区域中,使得类似的数据块被压缩并存储在相同的压缩区域中。

    File indexing using an exclusion list of a deduplicated cache system of a storage system
    6.
    发明授权
    File indexing using an exclusion list of a deduplicated cache system of a storage system 有权
    使用存储系统的重复数据删除缓存系统的排除列表进行文件索引

    公开(公告)号:US09189414B1

    公开(公告)日:2015-11-17

    申请号:US14038687

    申请日:2013-09-26

    CPC classification number: G06F12/08 G06F12/0891 G06F12/12 G06F12/121 G06F13/12

    Abstract: A computer-implemented method for indexing content stored in a cache memory device is disclosed. The method starts with maintaining a file index having a plurality of extent entries, each extent entry corresponding to one of a plurality of file extents stored in a cache memory device that caches data stored in a persistent storage device of a storage system. In response to receiving a request to read a first file region of a first file, the method continues with retrieving first data block from the persistent storage device that contains the first data block, caching the first data block at a first storage location of the cache memory device; and creating a first extent entry in the file index having at least a first node, where the first node includes an address of the first storage location and a first bitmap indicating which data blocks are valid.

    Abstract translation: 公开了一种用于索引存储在高速缓冲存储器件中的内容的计算机实现的方法。 该方法开始于维护具有多个盘区条目的文件索引,每个盘区条目对应于存储在存储在存储系统的持久存储装置中的数据的高速缓冲存储器装置中存储的多个文件盘区中的一个。 响应于接收到读取第一文件的第一文件区域的请求,该方法继续从包含第一数据块的永久存储设备中检索第一数据块,在高速缓存的第一存储位置缓存第一数据块 记忆装置; 以及在所述文件索引中创建具有至少第一节点的第一扩展名条目,其中所述第一节点包括所述第一存储位置的地址,以及指示哪些数据块有效的第一位图。

    Method and system for processing large scale emails and limiting resource consumption and interruption therefrom
    8.
    发明授权
    Method and system for processing large scale emails and limiting resource consumption and interruption therefrom 有权
    处理大规模电子邮件的方法和系统,限制资源消耗和中断

    公开(公告)号:US09559999B1

    公开(公告)日:2017-01-31

    申请号:US14292714

    申请日:2014-05-30

    CPC classification number: H04L51/16 G06Q10/107 H04L51/12

    Abstract: Techniques for reducing and discouraging sending large scale emails are described herein. According to one embodiment, in response to a first email received from a sender to be sent to a list of recipients, a distribution cost of the first email is determined based on content of the first email and the recipients. An email client application is to present a first graphical user interface (GUI) page to the sender prompting a confirmation from the sender, where the first GUI page includes information indicating a size of the first email and a number of recipients, if the distribution cost of the first email is above a first predetermined threshold. In response to a positive confirmation from the sender, the first email is sent to the intended recipient.

    Abstract translation: 在此描述用于减少和阻止发送大规模电子邮件的技术。 根据一个实施例,响应于从发送方接收到的要发送到接收者列表的第一电子邮件,基于第一电子邮件和接收者的内容来确定第一电子邮件的分配成本。 电子邮件客户端应用程序是向发送者呈现第一图形用户界面(GUI)页面,提示来自发送者的确认,其中第一GUI页面包括指示第一电子邮件的大小的信息和多个收件人,如果分配成本 的第一电子邮件高于第一预定阈值。 响应发件人的肯定确认,第一封电子邮件将发送给预定的收件人。

    Method and system for predicting redundant array of independent disks (RAID) vulnerability
    9.
    发明授权
    Method and system for predicting redundant array of independent disks (RAID) vulnerability 有权
    用于预测独立磁盘冗余阵列(RAID)漏洞的方法和系统

    公开(公告)号:US09535779B1

    公开(公告)日:2017-01-03

    申请号:US14341669

    申请日:2014-07-25

    CPC classification number: G06F11/076 G06F11/00 G06F11/0727 G06F11/3034

    Abstract: Techniques for determining vulnerability of disks are described herein. According to one embodiment, for each of a plurality of disks representing a redundant array of independent disks (RAID), a reallocated sector count associated with the disk is obtained, the reallocated sector count representing a number of sectors that have been reallocated due to an error of a storage transaction to the disk. A failure probability of the disk given the obtained reallocated sector count is determined using a predictive model, wherein the predictive model was generated based on history operating data of a set of known disks. Thereafter, a failure probability of at least two of the disks in the RAID is determined based on the failure probability of each of the disks to determine vulnerability of the RAID.

    Abstract translation: 本文描述了用于确定磁盘的脆弱性的技术。 根据一个实施例,对于表示独立盘(RAID)的冗余阵列的多个盘中的每一个,获得与盘相关联的重新分配的扇区计数,所述重新分配的扇区数表示由于 存储事务到磁盘的错误。 使用预测模型确定给定所获得的重新分配的扇区计数的盘的故障概率,其中基于一组已知磁盘的历史操作数据生成预测模型。 此后,基于每个磁盘的故障概率来确定RAID中至少两个磁盘的故障概率,以确定RAID的漏洞。

    Indexing a deduplicated cache system by integrating fingerprints of underlying deduplicated storage system
    10.
    发明授权
    Indexing a deduplicated cache system by integrating fingerprints of underlying deduplicated storage system 有权
    通过集成底层重复数据删除的存储系统的指纹来索引重复数据删除的缓存系统

    公开(公告)号:US09336143B1

    公开(公告)日:2016-05-10

    申请号:US14038694

    申请日:2013-09-26

    Abstract: A computer-implemented method for indexing content stored in a cache memory device is disclosed. The method starts with maintaining a file index having a plurality of extent entries, each extent entry corresponding to one of a plurality of file extents stored in a cache memory device that caches data stored in a persistent storage device of a storage system. The method continues with maintaining a fingerprint index having a plurality of fingerprint entries, each mapping a fingerprint to a data region of a file indexed in the file index, wherein each fingerprint indexed in the fingerprint index is retrieved from metadata stored in the persistent storage device of the storage system when one or more corresponding data chunks were accessed, and deduplicating and accessing the file extents stored in the cache memory device using the file index and the fingerprint index.

    Abstract translation: 公开了一种用于索引存储在高速缓冲存储器件中的内容的计算机实现的方法。 该方法开始于维护具有多个盘区条目的文件索引,每个盘区条目对应于存储在存储在存储系统的持久存储装置中的数据的高速缓冲存储器装置中存储的多个文件盘区中的一个。 该方法继续保持具有多个指纹条目的指纹索引,每个指纹条目将指纹映射到文件索引中索引的文件的数据区域,其中从指纹索引中索引的每个指纹从存储在永久存储设备中的元数据中检索 当使用一个或多个对应的数据块时,使用文件索引和指纹索引对存储在高速缓冲存储器设备中的文件盘区进行重复数据删除和访问。

Patent Agency Ranking