Invention Grant
- Patent Title: Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks
-
Application No.: US14961370Application Date: 2015-12-07
-
Publication No.: US09697820B2Publication Date: 2017-07-04
- Inventor: Woojay Jeon
- Applicant: Apple Inc.
- Applicant Address: US CA Cupertino
- Assignee: Apple Inc.
- Current Assignee: Apple Inc.
- Current Assignee Address: US CA Cupertino
- Agency: Morrison & Foerster LLP
- Main IPC: G10L13/08
- IPC: G10L13/08 ; G10L13/07 ; G10L13/047

Abstract:
Systems and processes for performing unit-selection text-to-speech synthesis are provided. In one example process, a sequence of target units can represent a spoken pronunciation of text. A set of predicted acoustic model parameters of a second target unit can be determined using a set of acoustic features of a first candidate speech segment of a first target unit and a set of linguistic features of the second target unit. A likelihood score of the second candidate speech segment with respect to the first candidate speech segment can be determined using the set of predicted acoustic model parameters of the second target unit and a set of acoustic features of the second candidate speech segment of the second target unit. The second candidate speech segment can be selected for speech synthesis based on the determined likelihood score. Speech corresponding to the received text can be generated using the selected second candidate speech segment.
Public/Granted literature
- US20170092259A1 UNIT-SELECTION TEXT-TO-SPEECH SYNTHESIS USING CONCATENATION-SENSITIVE NEURAL NETWORKS Public/Granted day:2017-03-30
Information query