Quantized neural network training and inference

Invention Grant

US10831444B2 Quantized neural network training and inference 有权

Please log in to see more content

Patent Title: Quantized neural network training and inference
Application No.: US15478531

Application Date: 2017-04-04
Publication No.: US10831444B2

Publication Date: 2020-11-10
Inventor: Ran El-Yaniv , Itay Hubara , Daniel Soudry
Applicant: Technion Research & Development Foundation Limited
Applicant Address: IL Haifa
Assignee: Technion Research & Development Foundation Limited
Current Assignee: Technion Research & Development Foundation Limited
Current Assignee Address: IL Haifa
Main IPC: G06F7/523
IPC: G06F7/523 ; G06N3/04 ; G06F7/48 ; G06N3/08

Quantized neural network training and inference

Abstract:

Training neural networks by constructing a neural network model having neurons each associated with a quantized activation function adapted to output a quantized activation value. The neurons are arranged in layers and connected by connections associated quantized connection weight functions adapted to output quantized connection weight values. During a training process a plurality of weight gradients are calculated during backpropagation sub-processes by computing neuron gradients, each of an output of a respective the quantized activation function in one layer with respect to an input of the respective quantized activation function. Each neuron gradient is calculated such that when an absolute value of the input is smaller than a positive constant threshold value, the respective neuron gradient is set as a positive constant output value and when the absolute value of the input is smaller than the positive constant threshold value the neuron gradient is set to zero.

Public/Granted literature

US20170286830A1 QUANTIZED NEURAL NETWORK TRAINING AND INFERENCE Public/Granted day:2017-10-05

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F7/00	通过待处理的数据的指令或内容进行运算的数据处理的方法或装置（逻辑电路入H03K19/00）
G06F7/38	.只利用数制表示，例如利用二进制、三进制、十进制表示来完成计算的方法或装置
G06F7/48	..应用非形成接触器件的，例如，电子管、固体器件；应用非特定的器件的
G06F7/52	...进行乘法的；进行除法的（G06F7/483至G06F7/491,G06F7/544至G06F7/556优先）
G06F7/523	....只进行乘法的