Invention Grant
- Patent Title: Text-to-speech (TTS) processing with transfer of vocal characteristics
-
Application No.: US16430894Application Date: 2019-06-04
-
Publication No.: US11410684B1Publication Date: 2022-08-09
- Inventor: Viacheslav Klimkov , Thomas Renaud Drugman , Alexander Galkin , Srikanth Ronanki
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Pierce Atwood LLP
- Main IPC: G10L13/00
- IPC: G10L13/00 ; G10L25/78 ; G10L13/027 ; G10L15/16 ; G10L15/187 ; G06F16/38 ; G06N3/08 ; G06N20/20 ; G06F17/18 ; G06N3/04 ; G10L13/04 ; G10L13/033 ; G10L13/07

Abstract:
Audio data from a first, source speaker is received and processed to determine linguistic units and vocal characteristics corresponding to those linguistic units. The linguistic units may either be determined from received text data or may be determined from the audio data using automatic speech recognition. A model is trained using training data from a second, target speaker. The trained model concatenates the linguistic units with the vocal characteristics to produce output speech that has the “voice” of the target speaker and the vocal characteristics of the source speaker.
Information query