Invention Grant
- Patent Title: Separating speech by source in audio recordings by predicting isolated audio signals conditioned on speaker representations
-
Application No.: US17170657Application Date: 2021-02-08
-
Publication No.: US11475909B2Publication Date: 2022-10-18
- Inventor: Neil Zeghidour , David Grangier
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G10L21/028
- IPC: G10L21/028 ; G10L21/0316 ; G10L17/04 ; G10L17/18 ; G06N3/04 ; G06N3/08

Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech separation. One of the methods includes obtaining a recording comprising speech from a plurality of speakers; processing the recording using a speaker neural network having speaker parameter values and configured to process the recording in accordance with the speaker parameter values to generate a plurality of per-recording speaker representations, each speaker representation representing features of a respective identified speaker in the recording; and processing the per-recording speaker representations and the recording using a separation neural network having separation parameter values and configured to process the recording and the speaker representations in accordance with the separation parameter values to generate, for each speaker representation, a respective predicted isolated audio signal that corresponds to speech of one of the speakers in the recording.
Public/Granted literature
Information query