Invention Grant
- Patent Title: Neural network accelerator with parameters resident on chip
-
Application No.: US16569607Application Date: 2019-09-12
-
Publication No.: US11501144B2Publication Date: 2022-11-15
- Inventor: Olivier Temam , Harshit Khaitan , Ravi Narayanaswami , Dong Hyuk Woo
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06N3/063
- IPC: G06N3/063 ; G06N3/04 ; G06F13/00 ; G06F9/38 ; G06F17/16

Abstract:
One embodiment of an accelerator includes a computing unit; a first memory bank for storing input activations and a second memory bank for storing parameters used in performing computations, the second memory bank configured to store a sufficient amount of the neural network parameters on the computing unit to allow for latency below a specified level with throughput above a specified level. The computing unit includes at least one cell comprising at least one multiply accumulate (“MAC”) operator that receives parameters from the second memory bank and performs computations. The computing unit further includes a first traversal unit that provides a control signal to the first memory bank to cause an input activation to be provided to a data bus accessible by the MAC operator. The computing unit performs computations associated with at least one element of a data array, the one or more computations performed by the MAC operator.
Public/Granted literature
- US20200005128A1 NEURAL NETWORK ACCELERATOR WITH PARAMETERS RESIDENT ON CHIP Public/Granted day:2020-01-02
Information query