- Patent Title: System and method for generating customized text-to-speech voices
-
Application No.: US14965251Application Date: 2015-12-10
-
Publication No.: US09721558B2Publication Date: 2017-08-01
- Inventor: Srinivas Bangalore , Junlan Feng , Mazin Gilbert , Juergen Schroeter , Ann K. Syrdal , David Schulz
- Applicant: Nuance Communications, Inc.
- Applicant Address: US MA Burlington
- Assignee: Nuance Communications, Inc.
- Current Assignee: Nuance Communications, Inc.
- Current Assignee Address: US MA Burlington
- Main IPC: G10L13/06
- IPC: G10L13/06 ; G10L13/08 ; G10L13/033 ; G10L13/02 ; G10L15/197 ; G10L13/00

Abstract:
A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.
Public/Granted literature
- US20160093287A1 SYSTEM AND METHOD FOR GENERATING CUSTOMIZED TEXT-TO-SPEECH VOICES Public/Granted day:2016-03-31
Information query