Invention Grant
US09251118B2 Scheduling computation processes including all-to-all communications (A2A) for pipelined parallel processing among plurality of processor nodes constituting network of n-dimensional space
有权
调度计算过程包括用于在构成n维空间网络的多个处理器节点之间的用于流水线并行处理的全对通信(A2A)
- Patent Title: Scheduling computation processes including all-to-all communications (A2A) for pipelined parallel processing among plurality of processor nodes constituting network of n-dimensional space
- Patent Title (中): 调度计算过程包括用于在构成n维空间网络的多个处理器节点之间的用于流水线并行处理的全对通信(A2A)
-
Application No.: US13510196Application Date: 2010-11-15
-
Publication No.: US09251118B2Publication Date: 2016-02-02
- Inventor: Jun Doi , Yasushi Negishi
- Applicant: Jun Doi , Yasushi Negishi
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Scully, Scott, Murphy & Presser, P.C.
- Agent Jennifer Davis, Esq.
- Priority: JP2009-261113 20091116
- International Application: PCT/JP2010/070314 WO 20101115
- International Announcement: WO2011/059090 WO 20110519
- Main IPC: G06F9/46
- IPC: G06F9/46 ; G06F15/173 ; G06F15/80 ; G06F9/50 ; H04L29/08 ; G06F7/38

Abstract:
Optimally scheduling a plurality of computation processes including all-to-all communications (A2A) among a plurality of nodes (processors) constituting an n-dimensional (a torus or a mesh) network.The plurality of nodes (processors) constituting the network are divided into a communication (computation process) phase (A2A-L) required for all-to-all communications only among a plurality of nodes included in a first subgroup and a communication (computation process) phase (A2A-P) required for all-to-all communications only among a plurality of nodes included in a second subgroup to perform parallel processing with the phases overlapped with each other across a plurality of threads (thread 1, thread 2, thread 3, and thread 4). It is also possible to perform the parallel processing with respect to a plurality of computation processes such as a fast Fourier transform (FFT) and a transpose (T) (internal transpose).
Public/Granted literature
Information query