Efficient neural network accelerator dataflows

Invention Grant

US11270197B2 Efficient neural network accelerator dataflows 有权

Please log in to see more content

Patent Title: Efficient neural network accelerator dataflows
Application No.: US16672918

Application Date: 2019-11-04
Publication No.: US11270197B2

Publication Date: 2022-03-08
Inventor: Yakun Shao , Rangharajan Venkatesan , Miaorong Wang , Daniel Smith , William James Dally , Joel Emer , Stephen W. Keckler , Brucek Khailany
Applicant: NVIDIA Corp.
Applicant Address: US CA Santa Clara
Assignee: NVIDIA Corp.
Current Assignee: NVIDIA Corp.
Current Assignee Address: US CA Santa Clara
Agency: Rowan TELS LLC
Main IPC: G06F17/16
IPC: G06F17/16 ; G06N3/063 ; G06F9/38 ; G06N3/08

Efficient neural network accelerator dataflows

Abstract:

A distributed deep neural net (DNN) utilizing a distributed, tile-based architecture includes multiple chips, each with a central processing element, a global memory buffer, and a plurality of additional processing elements. Each additional processing element includes a weight buffer, an activation buffer, and vector multiply-accumulate units to combine, in parallel, the weight values and the activation values using stationary data flows.

Public/Granted literature

US20200293867A1 EFFICIENT NEURAL NETWORK ACCELERATOR DATAFLOWS Public/Granted day:2020-09-17

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F17/00	特别适用于特定功能的数字计算设备或数据处理设备或数据处理方法（信息检索，数据库结构或文件系统结构，G06F 16/00）
G06F17/10	.复杂数学运算的
G06F17/16	..矩阵或向量计算的