Invention Publication
- Patent Title: AUTOMATIC QUANTIZATION OF A FLOATING POINT MODEL
-
Application No.: US18331660Application Date: 2023-06-08
-
Publication No.: US20240053959A1Publication Date: 2024-02-15
- Inventor: Denys Makoviichuk , Jiazhuo Wang , Yang Wen
- Applicant: Snap Inc.
- Applicant Address: US CA Santa Monica
- Assignee: Snap Inc.
- Current Assignee: Snap Inc.
- Current Assignee Address: US CA Santa Monica
- Main IPC: G06F7/483
- IPC: G06F7/483

Abstract:
Aspects of the present disclosure involve a system comprising a computer-readable storage medium storing a program and method for automatic quantization of a floating point model. The program and method provide for providing a floating point model to an automatic quantization library, the floating point model being configured to represent a neural network, and the automatic quantization library being configured to generate a first quantized model based on the floating point model; providing a function to the automatic quantization library, the function being configured to run a forward pass on a given dataset for the floating point model; causing the automatic quantization library to generate the first quantized model based on the floating point model; causing the automatic quantization library to calibrate the first quantized model by running the first quantized model on the function; and converting the calibrated first quantized model to a second quantized model.
Information query