Heterogenous memory accommodating multiple erasure codes

    公开(公告)号:US12066895B2

    公开(公告)日:2024-08-20

    申请号:US18175422

    申请日:2023-02-27

    Abstract: A method for proactively rebuilding user data in a plurality of storage nodes of a storage cluster is provided. The method includes distributing user data and metadata throughout the plurality of storage nodes such that the plurality of storage nodes can read the user data, using erasure coding, despite loss of two of the storage nodes. The method includes determining that one of the storage nodes is unreachable and determining to rebuild the user data for the one of the storage nodes that is unreachable. The method includes reading the user data across a remainder of the plurality of storage nodes, using the erasure coding and writing the user data across the remainder of the plurality of storage nodes, using the erasure coding. A plurality of storage nodes within a single chassis that can proactively rebuild the user data stored within the storage nodes is also provided.

    DELEGATION OF DATA OWNERSHIP
    3.
    发明申请

    公开(公告)号:US20230120685A1

    公开(公告)日:2023-04-20

    申请号:US18084339

    申请日:2022-12-19

    Abstract: A processor-based method for locating data and metadata closely together in a storage system is provided. The method includes writing a first range of a file and a first metadata relating to attributes of the file into at least one segment controlled by a first authority of the file. The method includes delegating, by the first authority, a second authority for a second range of the file, and writing the second range of the file and second metadata relating to the attributes of the file into at least one segment controlled by the second authority.

    FILE LOCATION FOR DISTRIBUTED DIRECTORY

    公开(公告)号:US20220107920A1

    公开(公告)日:2022-04-07

    申请号:US17552603

    申请日:2021-12-16

    Abstract: A method for distribution of directories in a storage system is provided. The method includes distributing information, regarding location in the storage system of a plurality of files in a directory, to a plurality of owners in a plurality of storage nodes of the storage system, wherein ownership of differing subsets of the plurality of files of the directory is distributed among differing owners in differing storage nodes. The method includes receiving a request for location information in the storage system of a file of the plurality of files in the directory and determining, based on a file name of the file and an identity of the directory, which of the plurality of owners has ownership of the file and the location information for the file.

    REDUCING DATA DISTRIBUTION INEFFICIENCIES

    公开(公告)号:US20210382618A1

    公开(公告)日:2021-12-09

    申请号:US17407813

    申请日:2021-08-20

    Abstract: Systems and methods of deduplication aware scalable content placement are described. A method may include receiving data to be stored on one or more nodes of a storage array and calculating a plurality of hashes corresponding to the data. The method further includes determining a first subset of the plurality of hashes, determining a second subset of the plurality of hashes of the first subset, and generating a node candidate placement list. The method may further include sending the first subset to one or more nodes represented on the node candidate placement list and receiving, from the nodes represented on the node candidate placement list, characteristics corresponding to the nodes represented on the candidate placement list. The method may further include identifying one of the one or more nodes represented on the candidate placement list in view of the characteristic and sending the data to the identified node.

    Reducing data distribution inefficiencies

    公开(公告)号:US11119656B2

    公开(公告)日:2021-09-14

    申请号:US16436482

    申请日:2019-06-10

    Abstract: Systems and methods of deduplication aware scalable content placement am described. A method may include receiving data to be stored on one or more nodes of a storage array and calculating a plurality of hashes corresponding to the data. The method further includes determining a first subset of the plurality of hashes, determining a second subset of the plurality of hashes of the first subset, and generating a node candidate placement list. The method may further include sending the first subset to one or more nodes represented on the node candidate placement list and receiving, from the nodes represented on the node candidate placement list, characteristics corresponding to the nodes represented on the candidate placement list. The method may further include identifying one of the one or more nodes represented on the candidate placement list m view of the characteristic and sending the data to the identified node.

    PERSISTENT FILE LOCKS IN A STORAGE SYSTEM

    公开(公告)号:US20210232545A1

    公开(公告)日:2021-07-29

    申请号:US17213823

    申请日:2021-03-26

    Abstract: A method for file locks in a distributed file system, performed by a storage system is provided. The method includes receiving, at a storage system having a distributed file system, a request for access of a file and locking the file, through one of a plurality of persistent file locks in the storage system. The method includes accessing the file, through the distributed file system, and unlocking the file, through the one of the plurality of persistent file locks.

    Data transformation caching in an artificial intelligence infrastructure

    公开(公告)号:US10275285B1

    公开(公告)日:2019-04-30

    申请号:US16046337

    申请日:2018-07-26

    Abstract: Data transformation caching in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: identifying, in dependence upon one or more machine learning models to be executed on the GPU servers, one or more transformations to apply to a dataset; generating, in dependence upon the one or more transformations, a transformed dataset; storing, within one or more of the storage systems, the transformed dataset; receiving a plurality of requests to transmit the transformed dataset to one or more of the GPU servers; and responsive to each request, transmitting, from the one or more storage systems to the one or more GPU servers without re-performing the one or more transformations on the dataset, the transformed dataset.

Patent Agency Ranking