Invention Grant
- Patent Title: Rescoring speech recognition hypothesis using prosodic likelihood
- Patent Title (中): 使用韵律可能性解答语音识别假设
-
Application No.: US12672015Application Date: 2008-08-22
-
Publication No.: US08315870B2Publication Date: 2012-11-20
- Inventor: Ken Hanazawa
- Applicant: Ken Hanazawa
- Applicant Address: JP Tokyo
- Assignee: NEC Corporation
- Current Assignee: NEC Corporation
- Current Assignee Address: JP Tokyo
- Priority: JP2007-215958 20070822
- International Application: PCT/JP2008/065008 WO 20080822
- International Announcement: WO2009/025356 WO 20090226
- Main IPC: G10L15/04
- IPC: G10L15/04

Abstract:
A distance calculation unit (16) obtains the acoustic distance between the feature amount of input speech and each phonetic model. A word search unit (17) performs a word search based on the acoustic distance and a language model including the phoneme and prosodic label of a word, and outputs a word hypothesis and a first score representing the likelihood of the word hypothesis. The word search unit (17) also outputs a vowel interval and its tone label in the input speech, when assuming that the recognition result of the input speech is the word hypothesis. A tone recognition unit (21) outputs a second score representing the likelihood of the tone label output from the word search unit (17) based on a feature amount corresponding to the vowel interval output from the word search unit (17). A rescore unit (22) corrects the first score of the word hypothesis output from the word search unit (17) using the second score output from the tone recognition unit (21). This allows to raise the speech recognition accuracy for tone speech.
Public/Granted literature
- US20110196678A1 SPEECH RECOGNITION APPARATUS AND SPEECH RECOGNITION METHOD Public/Granted day:2011-08-11
Information query