Invention Grant
- Patent Title: Phoneme-based contextualization for cross-lingual speech recognition in end-to-end models
-
Application No.: US16861190Application Date: 2020-04-28
-
Publication No.: US11270687B2Publication Date: 2022-03-08
- Inventor: Ke Hu , Antoine Jean Bruguier , Tara N. Sainath , Rohit Prakash Prabhavalkar , Golan Pundak
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Honigman LLP
- Agent Brett A. Krueger; Grant Griffith
- Main IPC: G10L15/30
- IPC: G10L15/30 ; G10L15/06 ; G10L15/02 ; G10L15/187 ; G10L15/193 ; G10L15/28 ; G10L15/32 ; G10L25/30

Abstract:
A method includes receiving audio data encoding an utterance spoken by a native speaker of a first language, and receiving a biasing term list including one or more terms in a second language different than the first language. The method also includes processing, using a speech recognition model, acoustic features derived from the audio data to generate speech recognition scores for both wordpieces and corresponding phoneme sequences in the first language. The method also includes rescoring the speech recognition scores for the phoneme sequences based on the one or more terms in the biasing term list, and executing, using the speech recognition scores for the wordpieces and the rescored speech recognition scores for the phoneme sequences, a decoding graph to generate a transcription for the utterance.
Public/Granted literature
- US20200349923A1 PHONEME-BASED CONTEXTUALIZATION FOR CROSS-LINGUAL SPEECH RECOGNITION IN END-TO-END MODELS Public/Granted day:2020-11-05
Information query