System and Method for Learning Alternate Pronunciations for Speech Recognition
    5.
    发明申请
    System and Method for Learning Alternate Pronunciations for Speech Recognition 有权
    学习用于语音识别的替代发音的系统和方法

    公开(公告)号:US20150106082A1

    公开(公告)日:2015-04-16

    申请号:US14515607

    申请日:2014-10-16

    Abstract: A system and method for learning alternate pronunciations for speech recognition is disclosed. Alternative name pronunciations may be covered, through pronunciation learning, that have not been previously covered in a general pronunciation dictionary. In an embodiment, the detection of phone-level and syllable-level mispronunciations in words and sentences may be based on acoustic models trained by Hidden Markov Models. Mispronunciations may be detected by comparing the likelihood of the potential state of the targeting pronunciation unit with a pre-determined threshold through a series of tests. It is also within the scope of an embodiment to detect accents.

    Abstract translation: 公开了用于学习语音识别的替代发音的系统和方法。 通过发音学习可以覆盖另类名称发音,这些发音先前未被一般发音词典涵盖。 在一个实施例中,在单词和句子中检测电话级和音节级错误可以基于由隐马尔可夫模型训练的声学模型。 可以通过一系列测试来比较目标语音单元的潜在状态与预定阈值的可能性来检测微分。 检测重音也属于实施例的范围。

    System and method for speaker change detection

    公开(公告)号:US10535000B2

    公开(公告)日:2020-01-14

    申请号:US15727498

    申请日:2017-10-06

    Abstract: A method for training a neural network of a neural network based speaker classifier for use in speaker change detection. The method comprises: a) preprocessing input speech data; b) extracting a plurality of feature frames from the preprocessed input speech data; c) normalizing the extracted feature frames of each speaker within the preprocessed input speech data with each speaker's mean and variance; d) concatenating the normalized feature frames to form overlapped longer frames having a frame length and a hop size; e) inputting the overlapped longer frames to the neural network based speaker classifier; and f) training the neural network through forward-backward propagation.

Patent Agency Ranking