Exploiting sparseness in training deep neural networks

Invention Grant

US08700552B2 Exploiting sparseness in training deep neural networks 有权

Title translation: 在深层神经网络训练中利用稀疏性

Please log in to see more content

Patent Title: Exploiting sparseness in training deep neural networks
Patent Title (中): 在深层神经网络训练中利用稀疏性
Application No.: US13305741

Application Date: 2011-11-28
Publication No.: US08700552B2

Publication Date: 2014-04-15
Inventor: Dong Yu , Li Deng , Frank Torsten Bernd Seide , Gang Li
Applicant: Dong Yu , Li Deng , Frank Torsten Bernd Seide , Gang Li
Applicant Address: US WA Redmond
Assignee: Microsoft Corporation
Current Assignee: Microsoft Corporation
Current Assignee Address: US WA Redmond
Agent Steve Wight; Carole Boelitz; Micky Minhas
Main IPC: G06F15/18
IPC: G06F15/18 ; G06N3/08

Exploiting sparseness in training deep neural networks

Abstract:

Deep Neural Network (DNN) training technique embodiments are presented that train a DNN while exploiting the sparseness of non-zero hidden layer interconnection weight values. Generally, a fully connected DNN is initially trained by sweeping through a full training set a number of times. Then, for the most part, only the interconnections whose weight magnitudes exceed a minimum weight threshold are considered in further training. This minimum weight threshold can be established as a value that results in only a prescribed maximum number of interconnections being considered when setting interconnection weight values via an error back-propagation procedure during the training. It is noted that the continued DNN training tends to converge much faster than the initial training.

Abstract(Chinese):

提出了深层神经网络（DNN）训练技术实施例，其训练DNN，同时利用非零隐藏层互连权重值的稀疏性。通常，完全连接的DNN最初通过遍历完整的训练集多次进行训练。那么，在大多数情况下，只有重量大小超过最小重量阈值的互连在进一步的训练中被考虑。该最小权重阈值可以被建立为在训练期间通过错误反向传播过程设置互连权重值时仅考虑规定的最大数量的互连的值。值得注意的是，继续进行的DNN训练往往比初始训练快得多。

Public/Granted literature

US20130138589A1 EXPLOITING SPARSENESS IN TRAINING DEEP NEURAL NETWORKS Public/Granted day:2013-05-30

Information query

Espacenet