Method and apparatus for compressing neural network

Invention Grant

US11379723B2 Method and apparatus for compressing neural network 有权

Please log in to see more content

Patent Title: Method and apparatus for compressing neural network
Application No.: US16143719

Application Date: 2018-09-27
Publication No.: US11379723B2

Publication Date: 2022-07-05
Inventor: Gang Zhang
Applicant: Baidu Online Network Technology (Beijing) Co., Ltd.
Applicant Address: CN Beijing
Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
Current Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
Current Assignee Address: CN Beijing
Agency: Nixon Peabody LLP
Priority: CN201711473963.3 20171229
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N3/10 ; G06N3/04 ; G06N20/00

Method and apparatus for compressing neural network

Abstract:

A method and apparatus for compressing a neural network are provided. A specific embodiment of the method includes: acquiring a to-be-compressed trained neural network; selecting at least one layer from layers of the neural network as a to-be-compressed layer; performing following processing steps sequentially on each of the to-be-compressed layers in descending order of the number of level of the to-be-compressed layer: determining a pruning ratio based on a total number of parameters included in the to-be-compressed layer, selecting a parameter for pruning from the parameters included in the to-be-compressed layer based on the pruning ratio and a parameter value threshold, and training the pruned neural network based on a preset training sample using a machine learning method; and determining the neural network obtained after performing the processing steps on the selected at least one to-be-compressed layer as a compressed neural network, and storing the compressed neural network.

Public/Granted literature

US20190205759A1 METHOD AND APPARATUS FOR COMPRESSING NEURAL NETWORK Public/Granted day:2019-07-04

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法