Invention Grant
- Patent Title: Systems and methods for combining stochastic average gradient and hessian-free optimization for sequence training of deep neural networks
-
Application No.: US14793095Application Date: 2015-07-07
-
Publication No.: US09626621B2Publication Date: 2017-04-18
- Inventor: Pierre Dognin , Vaibhava Goel
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Ryan, Mason & Lewis, LLP
- Agent William Stock
- Main IPC: G10L15/16
- IPC: G10L15/16 ; G06N3/08 ; G10L15/06 ; G06N3/04

Abstract:
A method for training a deep neural network (DNN), comprises receiving and formatting speech data for the training, performing Hessian-free sequence training (HFST) on a first subset of a plurality of subsets of the speech data, and iteratively performing the HFST on successive subsets of the plurality of subsets of the speech data, wherein iteratively performing the HFST comprises reusing information from at least one previous iteration.
Public/Granted literature
Information query