Fault tolerance for a distributed computing system

Invention Grant

US09645811B2 Fault tolerance for a distributed computing system 有权

Please log in to see more content

Patent Title: Fault tolerance for a distributed computing system
Application No.: US14242655

Application Date: 2014-04-01
Publication No.: US09645811B2

Publication Date: 2017-05-09
Inventor: Devin Carlen , Joe Heck , Mike Szilagyi , Mark Guis , Ken Caruso , Yona Benjamin Mankin
Applicant: OC Acquisition LLC
Applicant Address: US CA Redwood Shores
Assignee: OC Acquisition LLC
Current Assignee: OC Acquisition LLC
Current Assignee Address: US CA Redwood Shores
Agency: Precision IP
Main IPC: G06F11/00
IPC: G06F11/00 ; G06F9/445 ; G06F11/07 ; H04L29/06 ; H04L29/08 ; G06F9/54 ; H04L29/14

Fault tolerance for a distributed computing system

Abstract:

In one embodiment, a method detects a failure of a container in a controller node where the container includes a service being performed and isolated from other services being performed in other containers on the controller node. The controller node terminates the container including the service and determines a known state for the service. The known state is known to be operational without including a cause of the failure and the service operated from the known state saving changes to the known state during operation separately from the known state. The controller node restarts the service in a new container that replaces the terminated container where the restarted service starts from the known state without using the changes.

Public/Granted literature

US20140298091A1 Fault Tolerance for a Distributed Computing System Public/Granted day:2014-10-02

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F11/00	错误检测；错误校正；监控（在记录载体上作出核对其正确性的方法或装置入G06K5/00；基于记录载体和传感器之间的相对运动而实现的信息存储中所用的方法或装置入G11B，例如G11B20/18；静态存储中所用的方法或装置入G11C29/00）