Invention Grant
- Patent Title: Network switch with integrated gradient aggregation for distributed machine learning
-
Application No.: US17741371Application Date: 2022-05-10
-
Publication No.: US11715040B1Publication Date: 2023-08-01
- Inventor: William Brad Matthews , Puneet Agarwal
- Applicant: Innovium, Inc.
- Applicant Address: US CA San Jose
- Assignee: Innovium, Inc.
- Current Assignee: Innovium, Inc.
- Current Assignee Address: US CA San Jose
- Agency: Shield Intellectual Property PC
- Agent Kirk D. Wong
- Main IPC: G06N20/00
- IPC: G06N20/00 ; H04L67/10 ; H04L47/2441 ; H04L49/00 ; H04L47/32 ; H04L49/25

Abstract:
Distributed machine learning systems and other distributed computing systems are improved by embedding compute logic at the network switch level to perform collective actions, such as reduction operations, on gradients or other data processed by the nodes of the system. The switch is configured to recognize data units that carry data associated with a collective action that needs to be performed by the distributed system, referred to herein as “compute data,” and process that data using a compute subsystem within the switch. The compute subsystem includes a compute engine that is configured to perform various operations on the compute data, such as “reduction” operations, and forward the results back to the compute nodes. The reduction operations may include, for instance, summation, averaging, bitwise operations, and so forth. In this manner, the network switch may take over some or all of the processing of the distributed system during the collective phase.
Information query