Abstract:
PROBLEM TO BE SOLVED: To integrate a file system with a RAID array so as to export precise information about the arrangement of data blocks in the RAID subsystem. SOLUTION: Individual CWL pointers are used for each disk 1022 in the disk array 1030. The algorithm aims to improve efficiency of RAID and read-out performance. For the above purpose, it always writes on the disk 1022 with the lowest CWL pointer. In addition, another disk is chosen only when space for a new file is start to be allocated or when N blocks on the same disk 1022 has been allocated for a single file. The result is that CWL pointers are never more than N blocks apart even on different disks 1024, and a large file can also be allocated in N consecutive blocks on the same disk 1022. COPYRIGHT: (C)2006,JPO&NCIPI
Abstract:
PROBLEM TO BE SOLVED: To enhance versatility and utility of a snapshot duplication means. SOLUTION: A system and a method are for performing remote asynchronous duplication, namely, mirroring of change of source file system snapshots to a duplicated file system at a destination by identifying a block changed by difference of block volume numbers identified in the case of scanning a logical file block index of each snapshot by utilizing scan (by a scanner) of blocks constituting two versions of snapshots of the source file system. A tree of the block regarding the file is checked, a pointer without changes between versions is bypassed and proceeded downward and changes of hierarchy of trees are identified and the changes are transmitted to a destination mirror, namely, a duplicated snapshot. At the destination, the destination snapshot is updated by using source change. Every deleted or changed inode already existing on the destination is transferred to a temporary directory, namely, "momentary" directory and when it is reused, it is re-linked with the reconstructed duplicated snapshot directory. COPYRIGHT: (C)2004,JPO
Abstract:
A system and method employs one or more clients of a virtualized compute cluster as an execution engine for a portion of a storage operating system implemented as a virtual machine on a storage system node of a storage system cluster. If there is processing bandwidth of a client that is not fully utilized and the load on the storage system node is high, the portion of the storage operating system is ported to the client of the compute cluster in a manner that externally distributes the storage architecture from the storage system cluster. Advantageously, the processing performance of the storage system cluster is improved by, among other things, offloading some of the net- work processing load from the storage system node.
Abstract:
Methods of iterating through a set of data objects on a source server, copying them to a destination server, and preparing a mapping database correlating source and destination data object identifiers are described and claimed. The mapping database also includes data retention policy information and policy discrepancy information. Systems using similar methods, and software to perform similar methods, are also described and claimed.
Abstract:
An improved system and method enhances performance of updates to sequential block storage of a storage system. A disk-based sort procedure is provided to establish locality among updates (write data) held in a disk-based log, thereby enabling the write data to be efficiently written to home locations on a home location array. As the write data is received, a log manager of the storage system temporarily stores the data efficiently on the disk-based log. As more write data arrives, the log manager sorts the data in the log in accordance with the sort procedure, thus increasing the locality of data when stored on the home location array. When the log approaches capacity, the log manager writes the sorted data to their home locations on the array with high locality and performance.
Abstract:
A sampling based technique for eliminating duplicate data (de-duplication) stored on storage resources, is provided. According to the invention, when a new data set, e.g., a backup data stream, is received by a server, e.g., a storage system or virtual tape library (VTL) system implementing the invention, one or more anchors are identified within the new data set. The anchors are identified using a novel anchor detection circuitry in accordance with an illustrative embodiment of the present invention. Upon receipt of the new data set by, for example, a network adapter of a VTL system, the data set is transferred using direct memory access (DMA) operations to a memory associated with an anchor detection hardware card that is operatively interconnected with the storage system. The anchor detection hardware card may be implemented as, for example, a FPGA is to quickly identify anchors within the data set. As the anchor detection process is performed using a hardware assist, the load on a main processor of the system is reduced, thereby enabling line speed de-duplication.
Abstract:
A method and an apparatus to store data patterns are presented. In one embodiment, the method includes searching a pattern repository to find prior copies of a pattern and to reference one of the prior copies, or insert a new copy, based on the access time of the prior copy and the effect on the sequential stream performance.
Abstract:
A cache is used in a network storage system that includes a plurality of data storage nodes in a storage cluster, to automatically spread read and write access load, by a plurality of storage clients, for file system data and metadata, across the plurality of data storage nodes.
Abstract:
A system and method for calculating and storing block fingerprints for data dedu- plication. A fingerprint extraction layer generates a fingerprint of a predefined size, e.g., 64 bits, for each data block stored by a storage system. Each fingerprint is stored in a fingerprint record, and the fingerprint records are, in turn, stored in a fingerprint database for access by the data deduplication module. The data deduplication module may peri¬ odically compare the fingerprints to identify duplicate fingerprints, which, in turn, indi¬ cate duplicate data blocks.
Abstract:
A system and method for examining the content of a client file system stored on an exported lun is provided. A client agent interfaces with a client file system to detemine block layout information of data containers within the client file system. This block layout information is then transmitted to the storage system exporting the lun. The storage system utilizes the block layout information to generate a shadow volume utilizing the storage system's file system.