Invention Grant
- Patent Title: Identifying failure in a tree network of a parallel computer
- Patent Title (中): 识别并行计算机的树形网络中的故障
-
Application No.: US11531787Application Date: 2006-09-14
-
Publication No.: US07783933B2Publication Date: 2010-08-24
- Inventor: Charles J. Archer , Kurt W. Pinnow , Brian P. Wallenfelt
- Applicant: Charles J. Archer , Kurt W. Pinnow , Brian P. Wallenfelt
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Biggers & Ohanian, LLP
- Main IPC: G06F11/00
- IPC: G06F11/00

Abstract:
Methods, parallel computers, and products are provided for identifying failure in a tree network of a parallel computer. The parallel computer includes one or more processing sets including an I/O node and a plurality of compute nodes. For each processing set embodiments include selecting a set of test compute nodes, the test compute nodes being a subset of the compute nodes of the processing set; measuring the performance of the I/O node of the processing set; measuring the performance of the selected set of test compute nodes; calculating a current test value in dependence upon the measured performance of the I/O node of the processing set, the measured performance of the set of test compute nodes, and a predetermined value for I/O node performance; and comparing the current test value with a predetermined tree performance threshold. If the current test value is below the predetermined tree performance threshold, embodiments include selecting another set of test compute nodes. If the current test value is not below the predetermined tree performance threshold, embodiments include selecting from the test compute nodes one or more potential problem nodes and testing individually potential problem nodes and links to potential problem nodes.
Public/Granted literature
- US20080072101A1 Identifying Failure in a Tree Network of a Parallel Computer Public/Granted day:2008-03-20
Information query