Differential bit width neural architecture search

Invention Grant

US11604960B2 Differential bit width neural architecture search 有权

Please log in to see more content

Patent Title: Differential bit width neural architecture search
Application No.: US16356928

Application Date: 2019-03-18
Publication No.: US11604960B2

Publication Date: 2023-03-14
Inventor: Kalin Ovtcharov , Eric S. Chung , Vahideh Akhlaghi , Ritchie Zhao
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Agency: Newport IP, LLC
Agent Leonard J. Hope
Main IPC: G06N3/063
IPC: G06N3/063 ; G06N3/04 ; G06N3/084

Differential bit width neural architecture search

Abstract:

Machine learning is utilized to learn an optimized quantization configuration for an artificial neural network (ANN). For example, an ANN can be utilized to learn an optimal bit width for quantizing weights for layers of the ANN. The ANN can also be utilized to learn an optimal bit width for quantizing activation values for the layers of the ANN. Once the bit widths have been learned, they can be utilized at inference time to improve the performance of the ANN by quantizing the weights and activation values of the layers of the ANN.

Public/Granted literature

US20200302269A1 DIFFERENTIAL BIT WIDTH NEURAL ARCHITECTURE SEARCH Public/Granted day:2020-09-24

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/06	..物理实现，即神经网络、神经元或神经元部分的硬件实现
G06N3/063	...采用电的