Method and apparatus for data-free post-training network quantization and generating synthetic data based on a pre-trained machine learning model

Invention Grant

US12154030B2 Method and apparatus for data-free post-training network quantization and generating synthetic data based on a pre-trained machine learning model 有权

Please log in to see more content

Patent Title: Method and apparatus for data-free post-training network quantization and generating synthetic data based on a pre-trained machine learning model
Application No.: US17096734

Application Date: 2020-11-12
Publication No.: US12154030B2

Publication Date: 2024-11-26
Inventor: Yoo Jin Choi , Mostafa El-Khamy , Jungwon Lee
Applicant: Samsung Electronics Co., Ltd.
Applicant Address: KR Suwon-si
Assignee: Samsung Electronics Co., Ltd.
Current Assignee: Samsung Electronics Co., Ltd.
Current Assignee Address: KR Suwon-si
Agency: Lewis Roca Rothgerber Christie LLP
Main IPC: G06N3/08
IPC: G06N3/08 ; G06F17/18 ; G06F18/2113 ; G06F18/214 ; G06F18/22 ; G06N3/045 ; G06N7/01

Method and apparatus for data-free post-training network quantization and generating synthetic data based on a pre-trained machine learning model

Abstract:

A method for training a generator, by a generator training system including a processor and memory, includes: extracting training statistical characteristics from a batch normalization layer of a pre-trained model, the training statistical characteristics including a training mean μ and a training variance σ2; initializing a generator configured with generator parameters; generating a batch of synthetic data using the generator; supplying the batch of synthetic data to the pre-trained model; measuring statistical characteristics of activations at the batch normalization layer and at the output of the pre-trained model in response to the batch of synthetic data, the statistical characteristics including a measured mean {circumflex over (μ)}ψ and a measured variance {circumflex over (σ)}ψ2; computing a training loss in accordance with a loss function Lψ based on μ, σ2, {circumflex over (μ)}ψ, and {circumflex over (σ)}ψ2; and iteratively updating the generator parameters in accordance with the training loss until a training completion condition is met to compute the generator.

Public/Granted literature

US20220083855A1 METHOD AND APPARATUS FOR DATA-FREE POST-TRAINING NETWORK QUANTIZATION AND GENERATING SYNTHETIC DATA BASED ON A PRE-TRAINED MACHINE LEARNING MODEL Public/Granted day:2022-03-17

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法