Statistics-aware weight quantization

Invention Grant

US11551077B2 Statistics-aware weight quantization 有权

Please log in to see more content

Patent Title: Statistics-aware weight quantization
Application No.: US16007984

Application Date: 2018-06-13
Publication No.: US11551077B2

Publication Date: 2023-01-10
Inventor: Zhuo Wang , Jungwook Choi , Kailash Gopalakrishnan , Pierce I-Jen Chuang
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agency: Amin, Turocy & Watson, LLP
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N3/04

Abstract:

Techniques for statistics-aware weight quantization are presented. To facilitate reducing the bit precision of weights, for a set of weights, a quantizer management component can estimate a quantization scale value to apply to a weight as a linear or non-linear function of the mean of a square of a weight value of the weight and the mean of an absolute value of the weight value, wherein the quantization scale value is determined to have a smaller quantization error than all, or at least almost all, other quantization errors associated with other quantization scale values. A quantizer component applies the quantization scale value to symmetrically and/or uniformly quantize weights of a layer of the set of weights to generate quantized weights, the weights being quantized using rounding. The respective quantized weights can be used to facilitate training and inference of a deep learning system.

Public/Granted literature

US20190385050A1 STATISTICS-AWARE WEIGHT QUANTIZATION Public/Granted day:2019-12-19

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法