Multistage curriculum training framework for acoustic-to-word speech recognition

Invention Grant

US11004443B2 Multistage curriculum training framework for acoustic-to-word speech recognition 有权

Please log in to see more content

Patent Title: Multistage curriculum training framework for acoustic-to-word speech recognition
Application No.: US16117373

Application Date: 2018-08-30
Publication No.: US11004443B2

Publication Date: 2021-05-11
Inventor: Chengzhu Yu , Chao Weng , Jia Cui , Dong Yu
Applicant: TENCENT AMERICA LLC
Applicant Address: US CA Palo Alto
Assignee: TENCENT AMERICA LLC
Current Assignee: TENCENT AMERICA LLC
Current Assignee Address: US CA Palo Alto
Agency: Sughrue Mion, PLLC
Main IPC: G10L15/187
IPC: G10L15/187 ; G10L15/06 ; G10L15/16

Multistage curriculum training framework for acoustic-to-word speech recognition

Abstract:

Methods and apparatuses are provided for performing acoustic to word (A2W) speech recognition training performed by at least one processor. The method includes initializing, by the at least one processor, one or more first layers of a neural network with phone based Connectionist Temporal Classification (CTC), initializing, by the at least one processor, one or more second layers of the neural network with grapheme based CTC, acquiring, by the at least one processor, training data and performing, by the at least one processor, A2W speech recognition training based the initialized one or more first layers and one or more second layers of the neural network using the training data.

Public/Granted literature

US20200074983A1 MULTISTAGE CURRICULUM TRAINING FRAMEWORK FOR ACOUSTIC-TO-WORD SPEECH RECOGNITION Public/Granted day:2020-03-05

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/18	..利用自然语言模型
G10L15/183	...用上下文相关性，例如：语言模型
G10L15/187	....语音上下文，例如：发音规则，声音策略限制，语音元语法