Invention Grant
US08880931B2 Method, distributed system and computer program for failure recovery
有权
方法,分布式系统和计算机程序,用于故障恢复
- Patent Title: Method, distributed system and computer program for failure recovery
- Patent Title (中): 方法,分布式系统和计算机程序,用于故障恢复
-
Application No.: US13520514Application Date: 2010-12-24
-
Publication No.: US08880931B2Publication Date: 2014-11-04
- Inventor: Wei Sun
- Applicant: Wei Sun
- Applicant Address: JP Tokyo
- Assignee: NEC Corporation
- Current Assignee: NEC Corporation
- Current Assignee Address: JP Tokyo
- Agency: Sughrue Mion, PLLC
- Priority: JP2010-000268 20100104
- International Application: PCT/JP2010/007523 WO 20101224
- International Announcement: WO2011/080910 WO 20110707
- Main IPC: G06F11/00
- IPC: G06F11/00 ; G06F11/20 ; G06F11/14

Abstract:
A distributed system includes: nodes each having a memory, running distributed processes, and checkpointing to create checkpoint data for each process; a selection unit selecting spare nodes for future failure recovery for each process; an allocation unit allocating and transmitting the checkpoint data to the spare nodes to make the spare nodes store the checkpoint data before failure; and a recovery unit selecting one checkpoint data for recovery, activates the selected checkpoint data to run a process on the spare node, or partitions the existing stored checkpoint data, when any checkpoint data is not suitable for recovery, the partitions of the checkpoint data as a whole being integrated into a complete checkpoint data; and transmitting the partitions from the spare nodes to a new node, and reorganizing the partitions into complete data to be activated to run a process on the new node.
Public/Granted literature
- US20120303998A1 METHOD, DISTRIBUTED SYSTEM AND COMPUTER PROGRAM FOR FAILURE RECOVERY Public/Granted day:2012-11-29
Information query