Invention Grant
US08315870B2 Rescoring speech recognition hypothesis using prosodic likelihood 有权
使用韵律可能性解答语音识别假设

  • Patent Title: Rescoring speech recognition hypothesis using prosodic likelihood
  • Patent Title (中): 使用韵律可能性解答语音识别假设
  • Application No.: US12672015
    Application Date: 2008-08-22
  • Publication No.: US08315870B2
    Publication Date: 2012-11-20
  • Inventor: Ken Hanazawa
  • Applicant: Ken Hanazawa
  • Applicant Address: JP Tokyo
  • Assignee: NEC Corporation
  • Current Assignee: NEC Corporation
  • Current Assignee Address: JP Tokyo
  • Priority: JP2007-215958 20070822
  • International Application: PCT/JP2008/065008 WO 20080822
  • International Announcement: WO2009/025356 WO 20090226
  • Main IPC: G10L15/04
  • IPC: G10L15/04
Rescoring speech recognition hypothesis using prosodic likelihood
Abstract:
A distance calculation unit (16) obtains the acoustic distance between the feature amount of input speech and each phonetic model. A word search unit (17) performs a word search based on the acoustic distance and a language model including the phoneme and prosodic label of a word, and outputs a word hypothesis and a first score representing the likelihood of the word hypothesis. The word search unit (17) also outputs a vowel interval and its tone label in the input speech, when assuming that the recognition result of the input speech is the word hypothesis. A tone recognition unit (21) outputs a second score representing the likelihood of the tone label output from the word search unit (17) based on a feature amount corresponding to the vowel interval output from the word search unit (17). A rescore unit (22) corrects the first score of the word hypothesis output from the word search unit (17) using the second score output from the tone recognition unit (21). This allows to raise the speech recognition accuracy for tone speech.
Public/Granted literature
Information query
Patent Agency Ranking
0/0