Invention Grant
- Patent Title: Asynchronous checkpointing with message passing to burst buffers
- Patent Title (中): 消息传递到突发缓冲区的异步检查点
-
Application No.: US13931940Application Date: 2013-06-30
-
Publication No.: US09244869B1Publication Date: 2016-01-26
- Inventor: John M. Bent , Sorin Faibish
- Applicant: EMC Corporation
- Applicant Address: US MA Hopkinton
- Assignee: EMC Corporation
- Current Assignee: EMC Corporation
- Current Assignee Address: US MA Hopkinton
- Agency: Ryan, Mason & Lewis, LLP
- Main IPC: G06F11/00
- IPC: G06F11/00 ; G06F13/16 ; G06F11/14 ; G06F17/30

Abstract:
Improved techniques are provided for asynchronous checkpointing in parallel computing environments. A burst buffer appliance is configured to communicate with a plurality of compute nodes of a parallel computing system over a network and also to store message logs for a plurality of processes executing on the compute nodes, wherein the plurality of processes employ asynchronous checkpointing. The processes executing on the compute nodes can exchange messages and/or perform other compute operations during an asynchronous checkpointing operation. The burst buffer appliance can optionally store checkpoint data that results from the asynchronous checkpointing operations. The burst buffer appliance can optionally store the messages using a partitioned data store, such as Multidimensional Data Hashing Indexing Middleware.
Information query