Method and apparatus for speech recognition using neural networks with speaker adaptation

Invention Grant

US09721561B2 Method and apparatus for speech recognition using neural networks with speaker adaptation 有权

Please log in to see more content

Patent Title: Method and apparatus for speech recognition using neural networks with speaker adaptation
Application No.: US14098259

Application Date: 2013-12-05
Publication No.: US09721561B2

Publication Date: 2017-08-01
Inventor: Yun Tang , Venkatesh Nagesha , Xing Fan
Applicant: Nuance Communications, Inc.
Applicant Address: US MA Burlington
Assignee: Nuance Communications, Inc.
Current Assignee: Nuance Communications, Inc.
Current Assignee Address: US MA Burlington
Agency: Hamilton, Brook, Smith & Reynolds, P.C.
Main IPC: G10L15/00
IPC: G10L15/00 ; G10L15/16 ; G10L15/02 ; G10L15/06 ; G10L15/07

Method and apparatus for speech recognition using neural networks with speaker adaptation

Abstract:

In a speech recognition system, deep neural networks (DNNs) are employed in phoneme recognition. While DNNs typically provide better phoneme recognition performance than other techniques, such as Gaussian mixture models (GMM), adapting a DNN to a particular speaker is a real challenge. According to at least one example embodiment, speech data and corresponding speaker data are both applied as input to a DNN. In response, the DNN generates a prediction of a phoneme based on the input speech data and the corresponding speaker data. The speaker data may be generated from the corresponding speech data.

Public/Granted literature

US20150161994A1 Method and Apparatus for Speech Recognition Using Neural Networks with Speaker Adaptation Public/Granted day:2015-06-11

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）