Invention Grant
- Patent Title: Efficient and more advanced implementation of ring-AllReduce algorithm for distributed parallel deep learning
-
Application No.: US16777711Application Date: 2020-01-30
-
Publication No.: US11520640B2Publication Date: 2022-12-06
- Inventor: Liang Han , Yang Jiao
- Applicant: ALIBABA GROUP HOLDING LIMITED
- Applicant Address: KY Grand Cayman
- Assignee: ALIBABA GROUP HOLDING LIMITED
- Current Assignee: ALIBABA GROUP HOLDING LIMITED
- Current Assignee Address: KY Grand Cayman
- Agency: Finnegan, Henderson, Farabow, Garrett & Dunner LLP
- Main IPC: G06F9/52
- IPC: G06F9/52 ; G06F9/48 ; G06N20/00

Abstract:
The present disclosure provides a method for syncing data of a computing task across a plurality of groups of computing nodes, each group comprising a set of computing nodes A-D, a set of intra-group interconnects that communicatively couple computing node A with computing nodes B and C and computing node D with computing nodes B and C, and a set of inter-group interconnects that communicatively couple a computing node A of a first group of the plurality of groups with a computing node A of a second group neighboring the first group, a computing node B of the first group with a computing node B of the second group, a computing node C of the first group with the computing node C of the second group, and a computing node D of the first group with a computing node D of the second group, the method comprising: syncing across a first dimension of computing nodes using a first set of ring connections, wherein the first set of ring connections are formed using inter-group and intra-group interconnects that communicatively couple the computing nodes along the first dimension; and broadcasting synced data across a second dimension of computing nodes using a second ring connection.
Public/Granted literature
Information query