Invention Grant
- Patent Title: Centered, left- and right-shifted deep neural networks and their combinations
-
Application No.: US15707562Application Date: 2017-09-18
-
Publication No.: US10255910B2Publication Date: 2019-04-09
- Inventor: Mudar Yaghi , Hassan Sawaf , Jinato Jiang
- Applicant: Apptek, Inc.
- Applicant Address: US VA McLean
- Assignee: AppTek, Inc.
- Current Assignee: AppTek, Inc.
- Current Assignee Address: US VA McLean
- Agency: Morgan, Lewis & Bockius LLP
- Agent Robert C. Bertin; Rachael Lea Leventhal
- Main IPC: G10L15/16
- IPC: G10L15/16 ; G06N3/08 ; G06N3/04

Abstract:
Deep Neural Networks (DNN) are time shifted relative to one another and trained. The time-shifted networks may then be combined to improve recognition accuracy. The approach is based on an automatic speech recognition (ASR) system using DNN and using time shifted features. Initially, a regular ASR model is trained to produce a first trained DNN. Then a top layer (e.g., SoftMax layer) and the last hidden layer (e.g., Sigmoid) are fine-tuned with same data set but with a feature window left- and right-shifted to create respective second and third left-shifted and right-shifted DNNs. From these three DNN networks, four combination networks may be generated: left- and right-shifted, left-shifted and centered, centered and right-shifted, and left-shifted, centered, and right-shifted. The centered networks are used to perform the initial (first-pass) ASR. Then the other six networks are used to perform rescoring. The resulting are combined using ROVER (recognizer output voting error reduction) or another technique to improve recognition performance as compared to the centered DNN by itself.
Public/Granted literature
- US20180082677A1 CENTERED, LEFT- AND RIGHT-SHIFTED DEEP NEURAL NETWORKS AND THEIR COMBINATIONS Public/Granted day:2018-03-22
Information query