Training apparatus for speech synthesis, speech synthesis apparatus and training method for training apparatus

Invention Grant

US10540956B2 Training apparatus for speech synthesis, speech synthesis apparatus and training method for training apparatus 有权

Please log in to see more content

Patent Title: Training apparatus for speech synthesis, speech synthesis apparatus and training method for training apparatus
Application No.: US15257247

Application Date: 2016-09-06
Publication No.: US10540956B2

Publication Date: 2020-01-21
Inventor: Yamato Ohtani , Kouichirou Mori
Applicant: Kabushiki Kaisha Toshiba
Applicant Address: JP Tokyo
Assignee: Kabushiki Kaisha Toshiba
Current Assignee: Kabushiki Kaisha Toshiba
Current Assignee Address: JP Tokyo
Agency: Knobbe, Martens, Olson & Bear, LLP
Priority: JP2015-183092 20150916
Main IPC: G10L13/04
IPC: G10L13/04

Training apparatus for speech synthesis, speech synthesis apparatus and training method for training apparatus

Abstract:

According to one embodiment, a training apparatus for speech synthesis includes a storage device and a hardware processor in communication with the storage device. The storage stores an average voice model, training speaker information representing a feature of speech of a training speaker and perception representation information represented by scores of one or more perception representations related to voice quality of the training speaker, the average voice model constructed by utilizing acoustic data extracted from speech waveforms of a plurality of speakers and language data. The hardware processor, based at least in part on the average voice model, the training speaker information, and the perception representation score, train one or more perception representation acoustic models corresponding to the one or more perception representations.

Public/Granted literature

US20170076715A1 TRAINING APPARATUS FOR SPEECH SYNTHESIS, SPEECH SYNTHESIS APPARATUS AND TRAINING METHOD FOR TRAINING APPARATUS Public/Granted day:2017-03-16

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备
G10L13/04	..语音合成系统的零部件，例如合成设备结构或存储器管理