Invention Grant
- Patent Title: Identifying keyword occurrences in audio data
- Patent Title (中): 识别音频数据中的关键字出现
-
Application No.: US12686892Application Date: 2010-01-13
-
Publication No.: US08423363B2Publication Date: 2013-04-16
- Inventor: Vishwa Nath Gupta , Gilles Boulianne
- Applicant: Vishwa Nath Gupta , Gilles Boulianne
- Applicant Address: CA Montréal, Québec
- Assignee: CRIM (Centre de Recherche Informatique de Montréal)
- Current Assignee: CRIM (Centre de Recherche Informatique de Montréal)
- Current Assignee Address: CA Montréal, Québec
- Agency: Wilmer Cutler Pickering Hale and Dorr LLP
- Main IPC: G10L15/28
- IPC: G10L15/28 ; G10L15/00 ; G10L15/18

Abstract:
Occurrences of one or more keywords in audio data are identified using a speech recognizer employing a language model to derive a transcript of the keywords. The transcript is converted into a phoneme sequence. The phonemes of the phoneme sequence are mapped to the audio data to derive a time-aligned phoneme sequence that is searched for occurrences of keyword phoneme sequences corresponding to the phonemes of the keywords. Searching includes computing a confusion matrix. The language model used by the speech recognizer is adapted to keywords by increasing the likelihoods of the keywords in the language model. For each potential occurrences keywords detected, a corresponding subset of the audio data may be played back to an operator to confirm whether the potential occurrences correspond to actual occurrences of the keywords.
Public/Granted literature
- US20100179811A1 IDENTIFYING KEYWORD OCCURRENCES IN AUDIO DATA Public/Granted day:2010-07-15
Information query