Invention Grant
- Patent Title: Initialization of CTC speech recognition with standard HMM
-
Application No.: US15645985Application Date: 2017-07-10
-
Publication No.: US10714076B2Publication Date: 2020-07-14
- Inventor: Xavier Menendez-Pidal , Ruxin Chen
- Applicant: Sony Interactive Entertainment Inc.
- Applicant Address: JP Tokyo
- Assignee: Sony Interactive Entertainment Inc.
- Current Assignee: Sony Interactive Entertainment Inc.
- Current Assignee Address: JP Tokyo
- Agency: JDI Patent
- Agent Joshua D. Isenberg; Robert Pullman
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G10L15/16 ; G10L15/14 ; G10L15/02 ; G10L15/06

Abstract:
A method for improved initialization of speech recognition system comprises mapping a trained hidden markov model based recognition node network (HMM) to a Connectionist Temporal Classification (CTC) based node label scheme. The central state of each frame in the HMM are mapped to CTC-labeled output nodes and the non-central states of each frame are mapped to CTC-blank nodes to generate a CTC-labeled HMM and each central state represents a phoneme from human speech detected and extracted by a computing device. Next the CTC-labeled HMM is trained using a cost function, wherein the cost function is not part of a CTC cost function. Finally the CTC-labeled HMM is trained using a CTC cost function to produce a CTC node network. The CTC node network may be iteratively trained by repeating the initialization steps.
Public/Granted literature
- US20190013015A1 INITIALIZATION OF CTC SPEECH RECOGNITION WITH STANDARD HMM Public/Granted day:2019-01-10
Information query