Token-wise training for attention based end-to-end speech recognition

Invention Grant

US11037547B2 Token-wise training for attention based end-to-end speech recognition 有权

Please log in to see more content

Patent Title: Token-wise training for attention based end-to-end speech recognition
Application No.: US16275971

Application Date: 2019-02-14
Publication No.: US11037547B2

Publication Date: 2021-06-15
Inventor: Peidong Wang , Jia Cui , Chao Weng , Dong Yu
Applicant: TENCENT AMERICA LLC
Applicant Address: US CA Palo Alto
Assignee: TENCENT AMERICA LLC
Current Assignee: TENCENT AMERICA LLC
Current Assignee Address: US CA Palo Alto
Agency: Sughrue Mion, PLLC
Main IPC: G06N20/00
IPC: G06N20/00 ; G06N7/00 ; G10L15/22 ; G10L15/06 ; G10L15/14

Token-wise training for attention based end-to-end speech recognition

Abstract:

A method of attention-based end-to-end (A-E2E) automatic speech recognition (ASR) training, includes performing cross-entropy training of a model, based on one or more input features of a speech signal, determining a posterior probability vector at a time of a first wrong token among one or more output tokens of the model of which the cross-entropy training is performed, and determining a loss of the first wrong token at the time, based on the determined posterior probability vector. The method further includes determining a total loss of a training set of the model of which the cross-entropy training is performed, based on the determined loss of the first wrong token, and updating the model of which the cross-entropy training is performed, based on the determined total loss of the training set.

Public/Granted literature

US20200265830A1 TOKEN-WISE TRAINING FOR ATTENTION BASED END-TO-END SPEECH RECOGNITION Public/Granted day:2020-08-20

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习