Speech recognition device, speech recognition method, and program
Abstract:
Provided is a speech recognition device capable of implementing end-to-end speech recognition considering a context. The speech recognition device includes a model parameter learning unit that learns a model parameter θ by using a word sequence of concern as an observation value and using a word sequence previous to the word sequence of concern, an acoustic feature value sequence corresponding to the word sequence of concern, and the model parameter θ as parameters to perform maximum likelihood estimation for a likelihood function of a probability that the observation value occurs under the parameters and an uttered speech recognition unit that repeats, in order of time sequence, processing of recognizing a word sequence to be recognized, the processing of recognizing the word sequence to be recognized being performed by using the word sequence to be recognized as an observation value and using an already recognized word sequence previous to the word sequence to be recognized, an acoustic feature value sequence corresponding to the word sequence to be recognized, and the learned model parameter θ as parameters and based on a maximum likelihood criterion for the likelihood function of the probability that the observation value occurs under the parameters.
Public/Granted literature
Information query
Patent Agency Ranking
0/0