Sparse neural network training optimization

Invention Grant

US10943171B2 Sparse neural network training optimization 有权

Please log in to see more content

Patent Title: Sparse neural network training optimization
Application No.: US15694742

Application Date: 2017-09-01
Publication No.: US10943171B2

Publication Date: 2021-03-09
Inventor: Qiang Wu , Ou Jin , Liang Xiong
Applicant: Facebook, Inc.
Applicant Address: US CA Menlo Park
Assignee: Facebook, Inc.
Current Assignee: Facebook, Inc.
Current Assignee Address: US CA Menlo Park
Agency: Baker Botts L.L.P.
Main IPC: G06N3/04
IPC: G06N3/04 ; G06N3/08 ; G06T1/40 ; G06T1/20 ; G06T1/60 ; G06F9/00

Sparse neural network training optimization

Abstract:

An optimized computer architecture for training an neural network includes a system having multiple GPUs. The neural network may be divided into separate portions, and a different portion is assigned to each of the multiple GPUs. Within each GPU, its portion is further divided across multiple training worker threads in multiple processing cores, and each processing core has lock-free access to a local parameter memory. The local parameter memory of each GPU is separately, and individually, synchronized with a remote master parameter memory by lock memory access. Each GPU has a separate set of communication worker threads dedicated to data transfer between the GPU and the remote parameter memory so that the GPU's training worker threads are not involved with cross GPU communications.

Public/Granted literature

US20190073590A1 Sparse Neural Network Training Optimization Public/Granted day:2019-03-07

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/04	..体系结构，例如，互连拓扑