Invention Grant
- Patent Title: Quantizing neural networks with batch normalization
-
Application No.: US16262772Application Date: 2019-01-30
-
Publication No.: US12033067B2Publication Date: 2024-07-09
- Inventor: Suharsh Vikram Sivakumar , Raghuraman Krishnamoorthi
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06F7/483 ; G06N3/04

Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network that has one or more batch normalized neural network layers for use by a quantized inference system. One of the methods includes receiving a first batch of training data; determining batch normalization statistics for the first batch of training data; determining a correction factor from the batch normalization statistics for the first batch of training data and the long-term moving averages of the batch normalization statistics; generating batch normalized weights from the floating point weights for the batch normalized first neural network layer, comprising applying the correction factor to the floating point weights of the batch normalized first neural network layer; quantizing the batch normalized weights; determining a gradient of an objective function; and updating the floating point weights using the gradient.
Information query