Decomposition of weight tensors in network with value quantization

Invention Grant

US12061981B1 Decomposition of weight tensors in network with value quantization 有权

Please log in to see more content

Patent Title: Decomposition of weight tensors in network with value quantization
Application No.: US17089660

Application Date: 2020-11-04
Publication No.: US12061981B1

Publication Date: 2024-08-13
Inventor: Eric A. Sather , Steven L. Teig
Applicant: Perceive Corporation
Applicant Address: US CA San Jose
Assignee: PERCEIVE CORPORATION
Current Assignee: PERCEIVE CORPORATION
Current Assignee Address: US CA San Jose
Agency: ADELI LLP
Main IPC: G06N3/08
IPC: G06N3/08 ; G06F9/345 ; G06F9/38

Decomposition of weight tensors in network with value quantization

Abstract:

Some embodiments provide a method for training parameters of a network. the method receives a machine-trained (MT) network with multiple layers of computation nodes. Each computation node of a set of the layers computes an output value based on a set of input values and a set of trained weight values. A first layer of the MT network includes a first number of filters. The method replaces the first layer with (i) a second layer having a second number of filters that is less than the first number of filters and (ii) a third layer having the first number of filters. Output values of computation nodes of the second layer are quantized and the third layer using the quantized output values of the second layer as input values.

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法