Invention Grant
- Patent Title: Speaker-adaptive synthesized voice
- Patent Title (中): 扬声器自适应合成语音
-
Application No.: US13319856Application Date: 2010-03-16
-
Publication No.: US08744853B2Publication Date: 2014-06-03
- Inventor: Masafumi Nishimura , Ryuki Tachibana
- Applicant: Masafumi Nishimura , Ryuki Tachibana
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Fleit Gibbons Gutman Bongini & Bianco PL
- Agent Jon A. Gibbons
- Priority: JP2009-129366 20090528
- International Application: PCT/JP2010/054413 WO 20100316
- International Announcement: WO2010/137385 WO 20101202
- Main IPC: G10L13/00
- IPC: G10L13/00 ; G10L15/00 ; G10L15/06

Abstract:
An objective is to provide a technique for accurately reproducing features of a fundamental frequency of a target-speaker's voice on the basis of only a small amount of learning data. A learning apparatus learns shift amounts from a reference source F0 pattern to a target F0 pattern of a target-speaker's voice. The learning apparatus associates a source F0 pattern of a learning text to a target F0 pattern of the same learning text by associating their peaks and troughs. For each of points on the target F0 pattern, the learning apparatus obtains shift amounts in a time-axis direction and in a frequency-axis direction from a corresponding point on the source F0 pattern in reference to a result of the association, and learns a decision tree using, as an input feature vector, linguistic information obtained by parsing the learning text, and using, as an output feature vector, the calculated shift amounts.
Public/Granted literature
- US20120059654A1 SPEAKER-ADAPTIVE SYNTHESIZED VOICE Public/Granted day:2012-03-08
Information query