Invention Grant
- Patent Title: Efficient neural network accelerator dataflows
-
Application No.: US16672918Application Date: 2019-11-04
-
Publication No.: US11270197B2Publication Date: 2022-03-08
- Inventor: Yakun Shao , Rangharajan Venkatesan , Miaorong Wang , Daniel Smith , William James Dally , Joel Emer , Stephen W. Keckler , Brucek Khailany
- Applicant: NVIDIA Corp.
- Applicant Address: US CA Santa Clara
- Assignee: NVIDIA Corp.
- Current Assignee: NVIDIA Corp.
- Current Assignee Address: US CA Santa Clara
- Agency: Rowan TELS LLC
- Main IPC: G06F17/16
- IPC: G06F17/16 ; G06N3/063 ; G06F9/38 ; G06N3/08

Abstract:
A distributed deep neural net (DNN) utilizing a distributed, tile-based architecture includes multiple chips, each with a central processing element, a global memory buffer, and a plurality of additional processing elements. Each additional processing element includes a weight buffer, an activation buffer, and vector multiply-accumulate units to combine, in parallel, the weight values and the activation values using stationary data flows.
Public/Granted literature
- US20200293867A1 EFFICIENT NEURAL NETWORK ACCELERATOR DATAFLOWS Public/Granted day:2020-09-17
Information query