Invention Grant
US07991615B2 Grapheme-to-phoneme conversion using acoustic data 有权
使用声学数据的语音对音素转换

Grapheme-to-phoneme conversion using acoustic data
Abstract:
Described is the use of acoustic data to improve grapheme-to-phoneme conversion for speech recognition, such as to more accurately recognize spoken names in a voice-dialing system. A joint model of acoustics and graphonemes (acoustic data, phonemes sequences, grapheme sequences and an alignment between phoneme sequences and grapheme sequences) is described, as is retraining by maximum likelihood training and discriminative training in adapting graphoneme model parameters using acoustic data. Also described is the unsupervised collection of grapheme labels for received acoustic data, thereby automatically obtaining a substantial number of actual samples that may be used in retraining. Speech input that does not meet a confidence threshold may be filtered out so as to not be used by the retrained model.
Public/Granted literature
Information query
Patent Agency Ranking
0/0