Invention Grant
- Patent Title: Method and apparatus for speech recognition using neural networks with speaker adaptation
-
Application No.: US14098259Application Date: 2013-12-05
-
Publication No.: US09721561B2Publication Date: 2017-08-01
- Inventor: Yun Tang , Venkatesh Nagesha , Xing Fan
- Applicant: Nuance Communications, Inc.
- Applicant Address: US MA Burlington
- Assignee: Nuance Communications, Inc.
- Current Assignee: Nuance Communications, Inc.
- Current Assignee Address: US MA Burlington
- Agency: Hamilton, Brook, Smith & Reynolds, P.C.
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G10L15/16 ; G10L15/02 ; G10L15/06 ; G10L15/07

Abstract:
In a speech recognition system, deep neural networks (DNNs) are employed in phoneme recognition. While DNNs typically provide better phoneme recognition performance than other techniques, such as Gaussian mixture models (GMM), adapting a DNN to a particular speaker is a real challenge. According to at least one example embodiment, speech data and corresponding speaker data are both applied as input to a DNN. In response, the DNN generates a prediction of a phoneme based on the input speech data and the corresponding speaker data. The speaker data may be generated from the corresponding speech data.
Public/Granted literature
- US20150161994A1 Method and Apparatus for Speech Recognition Using Neural Networks with Speaker Adaptation Public/Granted day:2015-06-11
Information query