-
1.
公开(公告)号:US20240046086A1
公开(公告)日:2024-02-08
申请号:US18269445
申请日:2021-12-13
Applicant: TSINGHUA UNIVERSITY
Inventor: Huaqiang WU , Qingtian ZHANG , Lingjun DAI
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: Disclosed are a quantization method and quantization apparatus for a weight of a neural network, and a storage medium. The neural network is implemented on the basis of a crossbar-enabled analog computing-in-memory (CACIM) system, and the quantization method includes: acquiring a distribution characteristic of a weight; and determining, according to the distribution characteristic of the weight, an initial quantization parameter for quantizing the weight to reduce a quantization error in quantizing the weight. The quantization method provided by the embodiments of the present disclosure does not pre-define the quantization method used, but determines the quantization parameter used for quantizing the weight according to the distribution characteristic of the weight to reduce the quantization error, so that the effect of the neural network model is better under the same mapping overhead, and the mapping overhead is smaller under the same effect of the neural network model.