Invention Application
- Patent Title: INITIALIZATION OF CTC SPEECH RECOGNITION WITH STANDARD HMM
-
Application No.: US15645985Application Date: 2017-07-10
-
Publication No.: US20190013015A1Publication Date: 2019-01-10
- Inventor: Xavier Menendez-Pidal , Ruxin Chen
- Applicant: Sony Interactive Entertainment Inc.
- Main IPC: G10L15/14
- IPC: G10L15/14 ; G10L15/16 ; G10L15/02

Abstract:
A method for improved initialization of speech recognition system comprises mapping a trained hidden markov model based recognition node network (HMM) to a Connectionist Temporal Classification (CTC) based node label scheme. The central state of each frame in the HMM are mapped to CTC-labeled output nodes and the non-central states of each frame are mapped to CTC-blank nodes to generate a CTC-labeled HMM and each central state represents a phoneme from human speech detected and extracted by a computing device. Next the CTC-labeled HMM is trained using a cost function, wherein the cost function is not part of a CTC cost function. Finally the CTC-labeled HMM is trained using a CTC cost function to produce a CTC node network. The CTC node network may be iteratively trained by repeating the initialization steps.
Public/Granted literature
- US10714076B2 Initialization of CTC speech recognition with standard HMM Public/Granted day:2020-07-14
Information query