Invention Grant
- Patent Title: Deep neural network accelerator with fine-grained parallelism discovery
-
Application No.: US15929093Application Date: 2019-01-23
-
Publication No.: US11966835B2Publication Date: 2024-04-23
- Inventor: Ching-En Lee , Yakun Shao , Angshuman Parashar , Joel Emer , Stephen W. Keckler
- Applicant: NVIDIA Corp.
- Applicant Address: US CA Santa Clara
- Assignee: NVIDIA CORP.
- Current Assignee: NVIDIA CORP.
- Current Assignee Address: US CA Santa Clara
- Agency: Rowan TELS LLC
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/04

Abstract:
A sparse convolutional neural network accelerator system that dynamically and efficiently identifies fine-grained parallelism in sparse convolution operations. The system determines matching pairs of non-zero input activations and weights from the compacted input activation and weight arrays utilizing a scalable, dynamic parallelism discovery unit (PDU) that performs a parallel search on the input activation array and the weight array to identify reducible input activation and weight pairs.
Information query