- Patent Title: Speech recognition device, speech recognition method, and program
-
Application No.: US17428959Application Date: 2020-01-27
-
Publication No.: US12057105B2Publication Date: 2024-08-06
- Inventor: Ryo Masumura , Tomohiro Tanaka , Takanobu Oba
- Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
- Applicant Address: JP Tokyo
- Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
- Current Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
- Current Assignee Address: JP Tokyo
- Priority: JP 19020396 2019.02.07
- International Application: PCT/JP2020/002648 2020.01.27
- International Announcement: WO2020/162238A 2020.08.13
- Date entered country: 2021-08-05
- Main IPC: G10L15/06
- IPC: G10L15/06 ; G10L15/14 ; G10L15/18

Abstract:
Provided is a speech recognition device capable of implementing end-to-end speech recognition considering a context. The speech recognition device includes a model parameter learning unit that learns a model parameter θ by using a word sequence of concern as an observation value and using a word sequence previous to the word sequence of concern, an acoustic feature value sequence corresponding to the word sequence of concern, and the model parameter θ as parameters to perform maximum likelihood estimation for a likelihood function of a probability that the observation value occurs under the parameters and an uttered speech recognition unit that repeats, in order of time sequence, processing of recognizing a word sequence to be recognized, the processing of recognizing the word sequence to be recognized being performed by using the word sequence to be recognized as an observation value and using an already recognized word sequence previous to the word sequence to be recognized, an acoustic feature value sequence corresponding to the word sequence to be recognized, and the learned model parameter θ as parameters and based on a maximum likelihood criterion for the likelihood function of the probability that the observation value occurs under the parameters.
Public/Granted literature
- US20220139374A1 SPEECH RECOGNITION DEVICE, SPEECH RECOGNITION METHOD, AND PROGRAM Public/Granted day:2022-05-05
Information query