Invention Grant
- Patent Title: Multilingual re-scoring models for automatic speech recognition
-
Application No.: US18589220Application Date: 2024-02-27
-
Publication No.: US12254875B2Publication Date: 2025-03-18
- Inventor: Neeraj Gaur , Tongzhou Chen , Ehsan Variani , Bhuvana Ramabhadran , Parisa Haghani , Pedro J. Moreno Mengibar
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Honigman LLP
- Agent Brett A. Krueger; Grant Griffith
- Main IPC: G10L15/197
- IPC: G10L15/197 ; G10L15/00 ; G10L15/16 ; G10L15/22

Abstract:
A method includes receiving a sequence of acoustic frames extracted from audio data corresponding to an utterance. During a first pass, the method includes processing the sequence of acoustic frames to generate N candidate hypotheses for the utterance. During a second pass, and for each candidate hypothesis, the method includes: generating a respective un-normalized likelihood score; generating a respective external language model score; generating a standalone score that models prior statistics of the corresponding candidate hypothesis; and generating a respective overall score for the candidate hypothesis based on the un-normalized likelihood score, the external language model score, and the standalone score. The method also includes selecting the candidate hypothesis having the highest respective overall score from among the N candidate hypotheses as a final transcription of the utterance.
Public/Granted literature
- US20240203409A1 Multilingual Re-Scoring Models for Automatic Speech Recognition Public/Granted day:2024-06-20
Information query