SYSTEM AND METHOD FOR SAMPLING BASED ELIMINATION OF DUPLICATE DATA
    1.
    发明申请
    SYSTEM AND METHOD FOR SAMPLING BASED ELIMINATION OF DUPLICATE DATA 审中-公开
    用于抽样基于消除重复数据的系统和方法

    公开(公告)号:WO2007127360A2

    公开(公告)日:2007-11-08

    申请号:PCT/US2007/010222

    申请日:2007-04-26

    CPC classification number: H03M7/00 H04N19/20 H04N19/23 H04N19/25

    Abstract: A technique for eliminating duplicate data is provided. Upon receipt of a new data set, one or more anchor points are identified within the data set. A bit-by-bit data comparison is then performed of the region surrounding the anchor point in the received data set with the region surrounding an anchor point stored within a pattern database to identify forward/backward delta values. The duplicate data identified by the anchor point, forward and backward delta values is then replaced in the received data set with a storage indicator.

    Abstract translation: 提供了一种消除重复数据的技术。 在接收到新的数据集之后,在数据集内识别一个或多个锚点。 然后,在包含存储在模式数据库内的锚定点周围的区域的接收数据集中围绕定位点的区域执行逐位数据比较,以识别前向/后向增量值。 由锚点,前向和后向增量值标识的重复数据随后用存储指示器在接收的数据集中替换。

    SYSTEM AND METHOD FOR ACCELERATING ANCHOR POINT DETECTION
    2.
    发明申请
    SYSTEM AND METHOD FOR ACCELERATING ANCHOR POINT DETECTION 审中-公开
    用于加速锚点检测的系统和方法

    公开(公告)号:WO2008153821A1

    公开(公告)日:2008-12-18

    申请号:PCT/US2008/006805

    申请日:2008-05-29

    CPC classification number: G06F17/30156 H03M7/3084

    Abstract: A sampling based technique for eliminating duplicate data (de-duplication) stored on storage resources, is provided. According to the invention, when a new data set, e.g., a backup data stream, is received by a server, e.g., a storage system or virtual tape library (VTL) system implementing the invention, one or more anchors are identified within the new data set. The anchors are identified using a novel anchor detection circuitry in accordance with an illustrative embodiment of the present invention. Upon receipt of the new data set by, for example, a network adapter of a VTL system, the data set is transferred using direct memory access (DMA) operations to a memory associated with an anchor detection hardware card that is operatively interconnected with the storage system. The anchor detection hardware card may be implemented as, for example, a FPGA is to quickly identify anchors within the data set. As the anchor detection process is performed using a hardware assist, the load on a main processor of the system is reduced, thereby enabling line speed de-duplication.

    Abstract translation: 提供了用于消除存储在存储资源上的重复数据(重复数据删除)的基于抽样的技术。 根据本发明,当服务器(例如,实施本发明的存储系统或虚拟磁带库(VTL))系统接收到诸如备份数据流的新数据集时,在新的数据集中识别出一个或多个锚点 数据集。 根据本发明的说明性实施例,使用新颖的锚定检测电路来识别锚。 在通过例如VTL系统的网络适配器接收到新数据集时,使用直接存储器访问(DMA)操作将数据集传送到与锚定检测硬件卡相关联的存储器,该存储器与存储器可操作地互连 系统。 锚定检测硬件卡可以被实现为例如FPGA快速识别数据集内的锚点。 由于使用硬件辅助进行锚定检测处理,系统的主处理器上的负载减少,从而实现线速度重复数据删除。

    METHOD AND APPARATUS TO STORE DATA PATTERNS
    3.
    发明申请
    METHOD AND APPARATUS TO STORE DATA PATTERNS 审中-公开
    存储数据模式的方法和装置

    公开(公告)号:WO2008094433A2

    公开(公告)日:2008-08-07

    申请号:PCT/US2008/000900

    申请日:2008-01-24

    Inventor: STAGER, Roger

    Abstract: A method and an apparatus to store data patterns are presented. In one embodiment, the method includes searching a pattern repository to find prior copies of a pattern and to reference one of the prior copies, or insert a new copy, based on the access time of the prior copy and the effect on the sequential stream performance.

    Abstract translation: 提出了存储数据模式的方法和装置。 在一个实施例中,该方法包括搜索模式存储库以基于先前副本的访问时间以及对顺序流性能的影响来查找模式的先前副本并且引用先前副本之一或插入新副本 。

    SYSTEM AND METHOD FOR BANDWIDTH OPTIMIZATION IN A NETWORK STORAGE ENVIRONMENT
    4.
    发明公开
    SYSTEM AND METHOD FOR BANDWIDTH OPTIMIZATION IN A NETWORK STORAGE ENVIRONMENT 审中-公开
    系统和方法用于在网络存储区域带宽优化

    公开(公告)号:EP2143023A1

    公开(公告)日:2010-01-13

    申请号:EP08726988.2

    申请日:2008-03-19

    CPC classification number: G06F17/30067

    Abstract: According to one or more embodiments of the present invention, a network cache intercepts data requested by a client from a remote server interconnected with the cache through one or more wide area network (WAN) links (e.g., for Wide Area File Services, or 'WAFS'). The network cache stores the data and sends the data to the client. The cache may then intercept a first write request for the data from the client to the remote server, and determine one or more portions of the data in the write request that changed from the data stored at the cache (e.g., according to one or more hashes created based on the data). The network cache then sends a second write request for only the changed portions of the data to the remote server.

Patent Agency Ranking