Invention Grant
- Patent Title: Method and apparatus for compressing neural network
-
Application No.: US16143719Application Date: 2018-09-27
-
Publication No.: US11379723B2Publication Date: 2022-07-05
- Inventor: Gang Zhang
- Applicant: Baidu Online Network Technology (Beijing) Co., Ltd.
- Applicant Address: CN Beijing
- Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
- Current Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
- Current Assignee Address: CN Beijing
- Agency: Nixon Peabody LLP
- Priority: CN201711473963.3 20171229
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/10 ; G06N3/04 ; G06N20/00

Abstract:
A method and apparatus for compressing a neural network are provided. A specific embodiment of the method includes: acquiring a to-be-compressed trained neural network; selecting at least one layer from layers of the neural network as a to-be-compressed layer; performing following processing steps sequentially on each of the to-be-compressed layers in descending order of the number of level of the to-be-compressed layer: determining a pruning ratio based on a total number of parameters included in the to-be-compressed layer, selecting a parameter for pruning from the parameters included in the to-be-compressed layer based on the pruning ratio and a parameter value threshold, and training the pruned neural network based on a preset training sample using a machine learning method; and determining the neural network obtained after performing the processing steps on the selected at least one to-be-compressed layer as a compressed neural network, and storing the compressed neural network.
Public/Granted literature
- US20190205759A1 METHOD AND APPARATUS FOR COMPRESSING NEURAL NETWORK Public/Granted day:2019-07-04
Information query