Multi-channel speech recognition

Invention Grant

US10199035B2 Multi-channel speech recognition 有权

Please log in to see more content

Patent Title: Multi-channel speech recognition
Application No.: US14087885

Application Date: 2013-11-22
Publication No.: US10199035B2

Publication Date: 2019-02-05
Inventor: Ilya Dan Melamed , Andrej Ljolje
Applicant: Nuance Communications, Inc.
Applicant Address: US MA Burlington
Assignee: NUANCE COMMUNICATIONS, INC.
Current Assignee: NUANCE COMMUNICATIONS, INC.
Current Assignee Address: US MA Burlington
Main IPC: G10L15/07
IPC: G10L15/07 ; G10L15/20 ; G10L15/22 ; G10L15/28

Abstract:

Systems, methods, and computer-readable storage devices for performing per-channel automatic speech recognition. An example system configured to practice the method combines a first audio signal of a first speaker in a communication session and a second audio signal from a second speaker in the communication session as a first audio channel and a second audio channel. The system can recognize speech in the first audio channel of the recording using a first model specific to the first speaker, and recognize speech in the second audio channel of the recording using a second model specific to the second speaker, wherein the first model is different from the second model. The system can generate recognized speech as an output from the communication session. The system can identify the models based on identifiers of the speakers, such as a telephone number, an IP address, a customer number, or account number.

Public/Granted literature

US20150149162A1 MULTI-CHANNEL SPEECH RECOGNITION Public/Granted day:2015-05-28

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）
G10L15/065	..适应
G10L15/07	...对讲话者