Speech synthesizer, and speech synthesis method and computer program product utilizing multiple-acoustic feature parameters selection
Abstract:
A speech synthesizer includes a statistical-model sequence generator, a multiple-acoustic feature parameter sequence generator, and a waveform generator. The statistical-model sequence generator generates, based on context information corresponding to an input text, a statistical model sequence that comprises a first sequence of a statistical model comprising a plurality of states. The multiple-acoustic feature parameter sequence generator, for each speech section corresponding to each state of the statistical model sequence, selects a first plurality of acoustic feature parameters from a first set of acoustic feature parameters extracted from a first speech waveform stored in a speech database and generates a multiple-acoustic feature parameter sequence that comprises a sequence of the first plurality of acoustic feature parameters. The waveform generator generates a distribution sequence based on the multiple-acoustic feature parameter sequence and generates a second speech waveform based on a second set of acoustic feature parameters generated based on the distribution sequence.
Information query
Patent Agency Ranking
0/0