Speech synthesizer, and speech synthesis method and computer program product utilizing multiple-acoustic feature parameters selection

Invention Grant

US10529314B2 Speech synthesizer, and speech synthesis method and computer program product utilizing multiple-acoustic feature parameters selection 有权

Please log in to see more content

Patent Title: Speech synthesizer, and speech synthesis method and computer program product utilizing multiple-acoustic feature parameters selection
Application No.: US15434440

Application Date: 2017-02-16
Publication No.: US10529314B2

Publication Date: 2020-01-07
Inventor: Masatsune Tamura , Masahiro Morita
Applicant: Kabushiki Kaisha Toshiba
Applicant Address: JP Tokyo
Assignee: Kabushiki Kaisha Toshiba
Current Assignee: Kabushiki Kaisha Toshiba
Current Assignee Address: JP Tokyo
Agency: Knobbe, Martens, Olson & Bear, LLP
Main IPC: G10L13/027
IPC: G10L13/027 ; G10L13/047 ; G10L13/08 ; G10L13/07 ; G10L13/10 ; G06F17/27

Speech synthesizer, and speech synthesis method and computer program product utilizing multiple-acoustic feature parameters selection

Abstract:

A speech synthesizer includes a statistical-model sequence generator, a multiple-acoustic feature parameter sequence generator, and a waveform generator. The statistical-model sequence generator generates, based on context information corresponding to an input text, a statistical model sequence that comprises a first sequence of a statistical model comprising a plurality of states. The multiple-acoustic feature parameter sequence generator, for each speech section corresponding to each state of the statistical model sequence, selects a first plurality of acoustic feature parameters from a first set of acoustic feature parameters extracted from a first speech waveform stored in a speech database and generates a multiple-acoustic feature parameter sequence that comprises a sequence of the first plurality of acoustic feature parameters. The waveform generator generates a distribution sequence based on the multiple-acoustic feature parameter sequence and generates a second speech waveform based on a second set of acoustic feature parameters generated based on the distribution sequence.

Public/Granted literature

US20170162186A1 SPEECH SYNTHESIZER, AND SPEECH SYNTHESIS METHOD AND COMPUTER PROGRAM PRODUCT Public/Granted day:2017-06-08

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备
G10L13/027	..概念－语音合成；从基于机器的概念产生自然词语（产生文本以外的语音合成参数的入G10L13/08）