Quantizing trained neural networks with removal of normalization
Abstract:
Various embodiments provide for quantizing a trained neural network with removal of normalization with respect to at least one layer of the quantized neural network, such as a quantized multiple fan-in layer (e.g., element-wise add or sum layer).
Information query
Patent Agency Ranking
0/0