Methods, systems, and circuits for speaker dependent voice recognition with a single lexicon

Invention Grant

US09633652B2 Methods, systems, and circuits for speaker dependent voice recognition with a single lexicon 有权

Please log in to see more content

Patent Title: Methods, systems, and circuits for speaker dependent voice recognition with a single lexicon
Application No.: US13854133

Application Date: 2013-03-31
Publication No.: US09633652B2

Publication Date: 2017-04-25
Inventor: Evelyn Kurniawati , Sapna George
Applicant: STMicroelectronics Asia Pacific Pte Ltd.
Applicant Address: SG Singapore
Assignee: STMicroelectronics Asia Pacific Pte Ltd.
Current Assignee: STMicroelectronics Asia Pacific Pte Ltd.
Current Assignee Address: SG Singapore
Agency: Seed IP Law Group LLP
Main IPC: G10L15/07
IPC: G10L15/07 ; G10L15/14 ; G10L17/04 ; G10L15/00 ; G10L15/06

Methods, systems, and circuits for speaker dependent voice recognition with a single lexicon

Abstract:

Embodiments reduce the complexity of speaker dependent speech recognition systems and methods by representing the code phrase (i.e., the word or words to be recognized) using a single Gaussian Mixture Model (GMM) which is adapted from a Universal Background Model (UBM). Only the parameters of the GMM need to be stored. Further reduction in computation is achieved by only checking the GMM component that is relevant to the keyword template. In this scheme, keyword template is represented by a sequence of the index of best performing component of the GMM of the keyword model. Only one template is saved by combining the registration template using Longest Common Sequence algorithm. The quality of the word model is continuously updated by performing expectation maximization iteration using the test word which is accepted as keyword model.

Public/Granted literature

US20140200890A1 METHODS, SYSTEMS, AND CIRCUITS FOR SPEAKER DEPENDENT VOICE RECOGNITION WITH A SINGLE LEXICON Public/Granted day:2014-07-17

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）
G10L15/065	..适应
G10L15/07	...对讲话者