Quantizing neural networks with batch normalization

Invention Grant

US12033067B2 Quantizing neural networks with batch normalization 有权

Please log in to see more content

Patent Title: Quantizing neural networks with batch normalization
Application No.: US16262772

Application Date: 2019-01-30
Publication No.: US12033067B2

Publication Date: 2024-07-09
Inventor: Suharsh Vikram Sivakumar , Raghuraman Krishnamoorthi
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: Fish & Richardson P.C.
Main IPC: G06N3/08
IPC: G06N3/08 ; G06F7/483 ; G06N3/04

Quantizing neural networks with batch normalization

Abstract:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network that has one or more batch normalized neural network layers for use by a quantized inference system. One of the methods includes receiving a first batch of training data; determining batch normalization statistics for the first batch of training data; determining a correction factor from the batch normalization statistics for the first batch of training data and the long-term moving averages of the batch normalization statistics; generating batch normalized weights from the floating point weights for the batch normalized first neural network layer, comprising applying the correction factor to the floating point weights of the batch normalized first neural network layer; quantizing the batch normalized weights; determining a gradient of an objective function; and updating the floating point weights using the gradient.

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法