Abstract:
A method, system and program code for synchronizing scheduler interrupts across multiple nodes of a cluster. Network timers and local scheduling timers are clocked off a system source clock. A processor in each computing node repeatedly reads a network time of day counter. The start of scheduler interrupts is synchronized when the time of day counter is at an integer multiple of a synchronizing integer number of network timer ticks. The processor sends an interprocessor scheduler interrupt to other processors in the node to synchronize scheduling timers in the computing node and throughout the cluster.
Abstract:
In accordance with one embodiment of the invention, a method of providing performance data for nodes in a high performance computing system receives a request for performance data for a node in the high performance computing system. According to the method, a driver in kernel space causes the performance data for the node to be stored in kernel memory. The kernel memory is accessible in userspace via a first system file.