AUTOMATIC QUANTIZATION OF A FLOATING POINT MODEL

Invention Publication

US20240053959A1 AUTOMATIC QUANTIZATION OF A FLOATING POINT MODEL 审中-公开

Please log in to see more content

Patent Title: AUTOMATIC QUANTIZATION OF A FLOATING POINT MODEL
Application No.: US18331660

Application Date: 2023-06-08
Publication No.: US20240053959A1

Publication Date: 2024-02-15
Inventor: Denys Makoviichuk , Jiazhuo Wang , Yang Wen
Applicant: Snap Inc.
Applicant Address: US CA Santa Monica
Assignee: Snap Inc.
Current Assignee: Snap Inc.
Current Assignee Address: US CA Santa Monica
Main IPC: G06F7/483
IPC: G06F7/483

AUTOMATIC QUANTIZATION OF A FLOATING POINT MODEL

Abstract:

Aspects of the present disclosure involve a system comprising a computer-readable storage medium storing a program and method for automatic quantization of a floating point model. The program and method provide for providing a floating point model to an automatic quantization library, the floating point model being configured to represent a neural network, and the automatic quantization library being configured to generate a first quantized model based on the floating point model; providing a function to the automatic quantization library, the function being configured to run a forward pass on a given dataset for the floating point model; causing the automatic quantization library to generate the first quantized model based on the floating point model; causing the automatic quantization library to calibrate the first quantized model by running the first quantized model on the function; and converting the calibrated first quantized model to a second quantized model.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F7/00	通过待处理的数据的指令或内容进行运算的数据处理的方法或装置（逻辑电路入H03K19/00）
G06F7/38	.只利用数制表示，例如利用二进制、三进制、十进制表示来完成计算的方法或装置
G06F7/48	..应用非形成接触器件的，例如，电子管、固体器件；应用非特定的器件的
G06F7/483	...用数制的非线性组合表示的数字计算，例如，有理数、对数系统、或浮点数