Optimized quantization for reduced resolution neural networks

Invention Grant

US11601134B2 Optimized quantization for reduced resolution neural networks 有权

Please log in to see more content

Patent Title: Optimized quantization for reduced resolution neural networks
Application No.: US16739484

Application Date: 2020-01-10
Publication No.: US11601134B2

Publication Date: 2023-03-07
Inventor: Akshay Malhotra , Thomas Rocznik , Christian Peters
Applicant: Robert Bosch GmbH
Applicant Address: DE Stuttgart
Assignee: Robert Bosch GmbH
Current Assignee: Robert Bosch GmbH
Current Assignee Address: DE Stuttgart
Agency: Brooks Kushman P.C.
Main IPC: G06N3/10
IPC: G06N3/10 ; G06N3/08 ; H03M7/24 ; G06F17/18 ; G06N20/00 ; G06N5/046 ; G06F17/16 ; G06F17/15

Optimized quantization for reduced resolution neural networks

Abstract:

A system and method for generating and using fixed-point operations for neural networks includes converting floating-point weighting factors into fixed-point weighting factors using a scaling factor. The scaling factor is defined to minimize a cost function and the scaling factor is derived from a set of multiples of a predetermined base. The set of possible scaling function is defined to reduce the computational effort for evaluating the cost function for each of a number of possible scaling factors. The system and method may be implemented in one or more controllers that are programmed to execute the logic.

Public/Granted literature

US20210218414A1 OPTIMIZED QUANTIZATION FOR REDUCED RESOLUTION NEURAL NETWORKS Public/Granted day:2021-07-15

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/10	..在通用计算机上的仿真