Invention Grant
US07870234B2 Highly scalable and highly available cluster system management scheme
有权
高可扩展性和高可用性的集群系统管理方案
- Patent Title: Highly scalable and highly available cluster system management scheme
- Patent Title (中): 高可扩展性和高可用性的集群系统管理方案
-
Application No.: US12139062Application Date: 2008-06-13
-
Publication No.: US07870234B2Publication Date: 2011-01-11
- Inventor: James W. Arendt , Ching-Yun Chao , Rodolfo Ausgusto Mancisidor
- Applicant: James W. Arendt , Ching-Yun Chao , Rodolfo Ausgusto Mancisidor
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Dillon & Yudell LLP
- Main IPC: G06F15/16
- IPC: G06F15/16 ; G06F12/00

Abstract:
A cluster system is treated as a set of resource groups, each resource group including a highly available application and the resources upon which it depends. A resource group may have between 2 and M data processing systems, where M is small relative to the cluster size N of the total cluster. Configuration and status information for the resource group is fully replicated only on those data processing systems which are members of the resource group. A configuration object/database record for the resource group has an associated owner list identifying the data processing systems which are members of the resource group and which may therefore manage the application. A data processing system may belong to more than one resource group, however, and configuration and status information for the data processing system is replicated to each data processing system which could be affected by failure of the subject data processing system—that is, any data processing system which belongs to at least one resource group also containing the subject data processing system. The partial replication scheme of the present invention allows resource groups to run in parallel, reduces the cost of data replication and access, is highly scalable and applicable to very large clusters, and provides better performance after a catastrophe such as a network partition.
Public/Granted literature
- US20080320112A1 Highly scalable and highly available cluster system management scheme Public/Granted day:2008-12-25
Information query