-
1.
公开(公告)号:JPH10320170A
公开(公告)日:1998-12-04
申请号:JP35042797
申请日:1997-12-19
Applicant: KOREA ELECTRONICS TELECOMM
Inventor: LEE JUNG CHUL , HAHN MIN SOO , LEE HANG SEOP
Abstract: PROBLEM TO BE SOLVED: To improve the naturalness of a synthesized voice and to synchronize multimedia and a text/voice converter(TTS) with each other by defining information needed to interlock additional metrical information other than a text and multimedia information and an interface between those pieces of information and the TTS by the TTS, and using them for synthesized voice generation. SOLUTION: A multimedia information input part 10 consists of synchronized information of a text, metrical information, and a moving picture. A data-by- media distributor 11 separates multimedia information by media, converts it into usable data structures, and transmits them. A language processing part 12 converts them by phonemes and estimates and symbolizes the metrical information. A metrical processing part 13 calculates the values of metrical control parameters other than metrical control parameters of the multimedia information. A synchronism adjuster 14 adjusts times by duration by phonemes for synchronizing a synthesized voice to a video signal. A signal processing part 15 receives the metrical information, etc., and generates a synthesized voice by making use of a synthesis unit data base 16.
-
公开(公告)号:DE19753454C2
公开(公告)日:2003-06-18
申请号:DE19753454
申请日:1997-12-02
Applicant: KOREA ELECTRONICS TELECOMM
Inventor: LEE JUNG CHUL , HAHN MIN SOO , LEE HANG SEOP , YANG JAE WOO , LEE YOUNGIIK
Abstract: The present invention provides a text-to-speech conversion system (TTS) for interlocking synchronizing with multimedia and a method for organizing input data of the TTS which can enhance the natural naturalness of synthesized speech and accomplish the synchronization of multimedia with TTS by defining additional prosody information, the information required to interlock synchronize TTS with multimedia, and interface between these this information and TTS for use in the production of the synthesized speech.
-
公开(公告)号:DE19753453B4
公开(公告)日:2004-11-18
申请号:DE19753453
申请日:1997-12-02
Applicant: KOREA ELECTRONICS TELECOMM
Inventor: YANG JAE WOO , LEE JUNG CHUL , HAHN MIN SOO , LEE HANG SEOP , LEE YOUNGJIK
IPC: G10L13/00 , G06F17/28 , G06F17/30 , G10L13/04 , G10L13/06 , G10L13/08 , G10L21/06 , G11B20/04 , G03B31/00
Abstract: A method of formatting and normalizing continuous lip motions to events in a moving picture besides text in a Text-To-Speech converter is provided. A synthesized speech is synchronized with a moving picture by using the method wherein the real speech data and the shape of a lip in the moving picture are analyzed, and information on the estimated lip shape and text information are directly used in generating the synthesized speech.
-
公开(公告)号:DE19753454A1
公开(公告)日:1998-11-12
申请号:DE19753454
申请日:1997-12-02
Applicant: KOREA ELECTRONICS TELECOMM
Inventor: LEE JUNG CHUL , HAHN MIN SOO , LEE HANG SEOP
Abstract: The system includes a multimedia information input unit (10) for organising text, information and individual characteristic. A data distributor (11) distributes the information of the multimedia information input unit to the information for each media. A speech processor converts the text distributed by the data distributor to a phoneme stream for estimation of prosopic information and for symbolising the information. A prosopic processor (13) calculates a value of a prosopic control parameters from the symbolised prosopic information using a rule and a table. A synchronisation adjusting unit (14) adjust the duration of the phoneme using the distributed synchronisation information. A signal processor (15) generates a synthetic speech using the prosopic control parameter and data of a synthetic data base (16). A picture output unit (17)output the distributed picture information onto a screen.
-
-
-