Invention Grant
- Patent Title: Statistics-aware weight quantization
-
Application No.: US16007984Application Date: 2018-06-13
-
Publication No.: US11551077B2Publication Date: 2023-01-10
- Inventor: Zhuo Wang , Jungwook Choi , Kailash Gopalakrishnan , Pierce I-Jen Chuang
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Amin, Turocy & Watson, LLP
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/04

Abstract:
Techniques for statistics-aware weight quantization are presented. To facilitate reducing the bit precision of weights, for a set of weights, a quantizer management component can estimate a quantization scale value to apply to a weight as a linear or non-linear function of the mean of a square of a weight value of the weight and the mean of an absolute value of the weight value, wherein the quantization scale value is determined to have a smaller quantization error than all, or at least almost all, other quantization errors associated with other quantization scale values. A quantizer component applies the quantization scale value to symmetrically and/or uniformly quantize weights of a layer of the set of weights to generate quantized weights, the weights being quantized using rounding. The respective quantized weights can be used to facilitate training and inference of a deep learning system.
Public/Granted literature
- US20190385050A1 STATISTICS-AWARE WEIGHT QUANTIZATION Public/Granted day:2019-12-19
Information query