Invention Grant
US08880931B2 Method, distributed system and computer program for failure recovery 有权
方法,分布式系统和计算机程序,用于故障恢复

  • Patent Title: Method, distributed system and computer program for failure recovery
  • Patent Title (中): 方法,分布式系统和计算机程序,用于故障恢复
  • Application No.: US13520514
    Application Date: 2010-12-24
  • Publication No.: US08880931B2
    Publication Date: 2014-11-04
  • Inventor: Wei Sun
  • Applicant: Wei Sun
  • Applicant Address: JP Tokyo
  • Assignee: NEC Corporation
  • Current Assignee: NEC Corporation
  • Current Assignee Address: JP Tokyo
  • Agency: Sughrue Mion, PLLC
  • Priority: JP2010-000268 20100104
  • International Application: PCT/JP2010/007523 WO 20101224
  • International Announcement: WO2011/080910 WO 20110707
  • Main IPC: G06F11/00
  • IPC: G06F11/00 G06F11/20 G06F11/14
Method, distributed system and computer program for failure recovery
Abstract:
A distributed system includes: nodes each having a memory, running distributed processes, and checkpointing to create checkpoint data for each process; a selection unit selecting spare nodes for future failure recovery for each process; an allocation unit allocating and transmitting the checkpoint data to the spare nodes to make the spare nodes store the checkpoint data before failure; and a recovery unit selecting one checkpoint data for recovery, activates the selected checkpoint data to run a process on the spare node, or partitions the existing stored checkpoint data, when any checkpoint data is not suitable for recovery, the partitions of the checkpoint data as a whole being integrated into a complete checkpoint data; and transmitting the partitions from the spare nodes to a new node, and reorganizing the partitions into complete data to be activated to run a process on the new node.
Information query
Patent Agency Ranking
0/0