Distributed memory checkpointing using storage class memory systems
Abstract:
Systems and methods are provided for implementing memory checkpointing using a distributed non-volatile memory system. For example, an application runs on a plurality of server nodes in a server cluster. Each server node includes system memory having volatile system memory and non-volatile system memory. A current application state of the application is maintained in the system memory of one or more server nodes. A checkpoint operation is performed to generate a distributed checkpoint image of the current application state of the application. The distributed checkpoint image is stored in the non-volatile system memory of the plurality of server nodes. Fault-tolerant parity data is generated for the distributed checkpoint image, and the fault-tolerant parity data for the distributed checkpoint image is stored in the non-volatile system memory of one or more of the plurality of server nodes.
Information query
Patent Agency Ranking
0/0