Accelerating neural networks with low precision-based multiplication and exploiting sparsity in higher order bits

Invention Grant

US11714998B2 Accelerating neural networks with low precision-based multiplication and exploiting sparsity in higher order bits 有权

Please log in to see more content

Patent Title: Accelerating neural networks with low precision-based multiplication and exploiting sparsity in higher order bits
Application No.: US16909295

Application Date: 2020-06-23
Publication No.: US11714998B2

Publication Date: 2023-08-01
Inventor: Avishaii Abuhatzera , Om Ji Omer , Ritwika Chowdhury , Lance Hacking
Applicant: Intel Corporation
Applicant Address: US CA Santa Clara
Assignee: INTEL CORPORATION
Current Assignee: INTEL CORPORATION
Current Assignee Address: US CA Santa Clara
Agency: Jaffery Watson Mendonsa & Hamilton LLP
Priority: IN 2041019060 2020.05.05
Main IPC: G06N3/063
IPC: G06N3/063 ; G06N3/08 ; G06N3/04 ; G06N3/088

Accelerating neural networks with low precision-based multiplication and exploiting sparsity in higher order bits

Abstract:

An apparatus to facilitate accelerating neural networks with low precision-based multiplication and exploiting sparsity in higher order bits is disclosed. The apparatus includes a processor comprising a re-encoder to re-encode a first input number of signed input numbers represented in a first precision format as part of a machine learning model, the first input number re-encoded into two signed input numbers of a second precision format, wherein the first precision format is a higher precision format than the second precision format. The processor further includes a multiply-add circuit to perform operations in the first precision format using the two signed input numbers of the second precision format; and a sparsity hardware circuit to reduce computing on zero values at the multiply-add circuit, wherein the processor to execute the machine learning model using the re-encoder, the multiply-add circuit, and the sparsity hardware circuit.

Public/Granted literature

US20200320375A1 ACCELERATING NEURAL NETWORKS WITH LOW PRECISION-BASED MULTIPLICATION AND EXPLOITING SPARSITY IN HIGHER ORDER BITS Public/Granted day:2020-10-08

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/06	..物理实现，即神经网络、神经元或神经元部分的硬件实现
G06N3/063	...采用电的