Invention Grant
- Patent Title: Injecting text in self-supervised speech pre-training
-
Application No.: US17808091Application Date: 2022-06-21
-
Publication No.: US12159617B2Publication Date: 2024-12-03
- Inventor: Zhehuai Chen , Bhuvana Ramabhadran , Andrew M. Rosenberg , Yu Zhang , Pedro J. Moreno Mengibar
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Honigman LLP
- Agent Brett A. Krueger; Grant Griffith
- Main IPC: G10L15/06
- IPC: G10L15/06 ; G10L13/047 ; G10L13/08 ; G10L15/16

Abstract:
A method includes receiving training data that includes unspoken text utterances and un-transcribed non-synthetic speech utterances. Each unspoken text utterance is not paired with any corresponding spoken utterance of non-synthetic speech. Each un-transcribed non-synthetic speech utterance is not paired with a corresponding transcription. The method also includes generating a corresponding synthetic speech representation for each unspoken textual utterance of the received training data using a text-to-speech model. The method also includes pre-training an audio encoder on the synthetic speech representations generated for the unspoken textual utterances and the un-transcribed non-synthetic speech utterances to teach the audio encoder to jointly learn shared speech and text representations.
Public/Granted literature
- US20230017892A1 Injecting Text in Self-Supervised Speech Pre-training Public/Granted day:2023-01-19
Information query