Method and system for text-to-speech synthesis

Invention Grant

US10685644B2 Method and system for text-to-speech synthesis 有权

Please log in to see more content

Patent Title: Method and system for text-to-speech synthesis
Application No.: US16027337

Application Date: 2018-07-04
Publication No.: US10685644B2

Publication Date: 2020-06-16
Inventor: Vladimir Vladimirovich Kirichenko , Petr Vladislavovich Luferenko
Applicant: YANDEX EUROPE AG
Applicant Address: CH Lucerne
Assignee: YANDEX EUROPE AG
Current Assignee: YANDEX EUROPE AG
Current Assignee Address: CH Lucerne
Agency: BCF LLP
Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@2dabdb65
Main IPC: G10L13/04
IPC: G10L13/04 ; G10L13/08 ; G10L13/02 ; G06F17/16 ; G10L15/187 ; G10L13/00 ; G10L13/06 ; G06N20/00 ; G06F40/205

Method and system for text-to-speech synthesis

Abstract:

There is disclosed a method of generating a text-to-speech (TTS) training set for training a Machine Learning Algorithm (MLA) for generating machine-spoken utterances The method is executable by a server. The method includes generating a synthetic word based on merging separate phonemes from each of two words of a corpus of pre-recorded utterances, the merging being done using the common phoneme as a merging anchor, the merging resulting in at least two synthetic words. The synthetic words and assessor labels are used to train a classifier to predict a quality parameter associated with a new synthetic phonemes-based word, the quality parameter being representative of whether the new synthetic phonemes-based word is naturally sounding (based on acoustic features of generated synthetic words utterances). The classifier is then used to generate training objects for the MLA and to use the MLA to process the corpus of pre-recorded utterances into their respective vectors.

Public/Granted literature

US20190206386A1 METHOD AND SYSTEM FOR TEXT-TO-SPEECH SYNTHESIS Public/Granted day:2019-07-04

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备
G10L13/04	..语音合成系统的零部件，例如合成设备结构或存储器管理