Invention Grant
- Patent Title: Dynamic quantization of neural networks
-
Application No.: US15857438Application Date: 2017-12-28
-
Publication No.: US11755901B2Publication Date: 2023-09-12
- Inventor: Michael E. Deisher
- Applicant: Intel Corporation
- Applicant Address: US CA Santa Clara
- Assignee: Intel Corporation
- Current Assignee: Intel Corporation
- Current Assignee Address: US CA Santa Clara
- Agency: HANLEY, FLIGHT AND ZIMMERMAN, LLC
- Main IPC: G06F5/01
- IPC: G06F5/01 ; G06N3/08 ; G06F7/57 ; G06F7/544 ; G06N3/063 ; G06N3/02 ; G06N3/044 ; G06N3/045 ; G06N3/048 ; G06F7/02

Abstract:
An apparatus for applying dynamic quantization of a neural network is described herein. The apparatus includes a scaling unit and a quantizing unit. The scaling unit is to calculate an initial desired scale factors of a plurality of inputs, weights and a bias and apply the input scale factor to a summation node. Also, the scaling unit is to determine a scale factor for a multiplication node based on the desired scale factors of the inputs and select a scale factor for an activation function and an output node. The quantizing unit is to dynamically requantize the neural network by traversing a graph of the neural network.
Public/Granted literature
- US20190042935A1 DYNAMIC QUANTIZATION OF NEURAL NETWORKS Public/Granted day:2019-02-07
Information query