Speech recognition device, speech recognition method, and program

Invention Grant

US12057105B2 Speech recognition device, speech recognition method, and program 有权

Please log in to see more content

Patent Title: Speech recognition device, speech recognition method, and program
Application No.: US17428959

Application Date: 2020-01-27
Publication No.: US12057105B2

Publication Date: 2024-08-06
Inventor: Ryo Masumura , Tomohiro Tanaka , Takanobu Oba
Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
Applicant Address: JP Tokyo
Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
Current Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
Current Assignee Address: JP Tokyo
Priority: JP 19020396 2019.02.07
International Application: PCT/JP2020/002648 2020.01.27
International Announcement: WO2020/162238A 2020.08.13
Date entered country: 2021-08-05
Main IPC: G10L15/06
IPC: G10L15/06 ; G10L15/14 ; G10L15/18

Speech recognition device, speech recognition method, and program

Abstract:

Provided is a speech recognition device capable of implementing end-to-end speech recognition considering a context. The speech recognition device includes a model parameter learning unit that learns a model parameter θ by using a word sequence of concern as an observation value and using a word sequence previous to the word sequence of concern, an acoustic feature value sequence corresponding to the word sequence of concern, and the model parameter θ as parameters to perform maximum likelihood estimation for a likelihood function of a probability that the observation value occurs under the parameters and an uttered speech recognition unit that repeats, in order of time sequence, processing of recognizing a word sequence to be recognized, the processing of recognizing the word sequence to be recognized being performed by using the word sequence to be recognized as an observation value and using an already recognized word sequence previous to the word sequence to be recognized, an acoustic feature value sequence corresponding to the word sequence to be recognized, and the learned model parameter θ as parameters and based on a maximum likelihood criterion for the likelihood function of the probability that the observation value occurs under the parameters.

Public/Granted literature

US20220139374A1 SPEECH RECOGNITION DEVICE, SPEECH RECOGNITION METHOD, AND PROGRAM Public/Granted day:2022-05-05

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）