-
公开(公告)号:DE69727046T2
公开(公告)日:2004-06-09
申请号:DE69727046
申请日:1997-10-15
Applicant: MOTOROLA INC
Inventor: CORRIGAN GERALD , KARAALI ORHAN , MASSEY NOEL
IPC: G10L13/08
Abstract: The present invention teaches a method (400), device and system (300) utilizing at least one of: mapping a sequence of phones to a sequence of articulatory features and utilizing prominence and boundary information, in addition to a predetermined set of rules for type, phonetic context, syntactic and prosodic context for phones to provide provide a system that generates segment durations efficiently with a small training set.
-
公开(公告)号:GB2346788B
公开(公告)日:2001-02-14
申请号:GB0009708
申请日:1998-07-20
Applicant: MOTOROLA INC
Inventor: MASSEY NOEL , KARAALI ORHAN , SCHNURR OTTO
-
23.
公开(公告)号:GB2346526B
公开(公告)日:2001-02-14
申请号:GB0009689
申请日:1998-07-20
Applicant: MOTOROLA INC
Inventor: MASSEY NOEL , KARAALI ORHAN , SCHNURR OTTO
-
24.
公开(公告)号:GB2346525B
公开(公告)日:2001-02-14
申请号:GB0009687
申请日:1998-07-20
Applicant: MOTOROLA INC
Inventor: MASSEY NOEL , KARAALI ORHAN , SCHNURR OTTO
-
公开(公告)号:DE19825205C2
公开(公告)日:2001-02-01
申请号:DE19825205
申请日:1998-06-05
Applicant: MOTOROLA INC
Inventor: MILLER COREY ANDREW , KARAALI ORHAN , MASSEY NOEL
-
26.
公开(公告)号:GB2346525A
公开(公告)日:2000-08-09
申请号:GB0009687
申请日:1998-07-20
Applicant: MOTOROLA INC
Inventor: MASSEY NOEL , KARAALI ORHAN , SCHNURR OTTO
Abstract: A method of training a neural network to provide spatial parameters when stimulated by linguistic representation of speech, comprising providing thereto a linguistic representation of speech correlated with spatial parameters and a data storage medium having stored thereon a linguistic representation of speech correlated with spatial parameters for training such neural network.
-
公开(公告)号:GB2326321B
公开(公告)日:1999-08-11
申请号:GB9812479
申请日:1998-06-11
Applicant: MOTOROLA INC
Inventor: MILLER COREY ANDREW , KARAALI ORHAN , MASSEY NOEL
-
28.
公开(公告)号:GB2326320B
公开(公告)日:1999-08-11
申请号:GB9812468
申请日:1998-06-11
Applicant: MOTOROLA INC
Inventor: KARAALI ORHAN , MILLER COREY ANDREW
Abstract: A method (2000), device (2200) and article of manufacture (2300) provide, in response to orthographic information, efficient generation of a phonetic representation. The method provides for, in response to orthographic information, efficient generation of a phonetic representation, using the steps of: inputting an orthography of a word and a predetermined set of input letter features; utilizing a neural network that has been trained using automatic letter phone alignment and predetermined letter features to provide a neural network hypothesis of a word pronunciation.
-
公开(公告)号:AU2104095A
公开(公告)日:1995-11-29
申请号:AU2104095
申请日:1995-03-21
Applicant: MOTOROLA INC
Inventor: KARAALI ORHAN , CORRIGAN GERALD EDWARD , GERSON IRA ALAN
Abstract: Text may be converted to audible signals, such as speech, by first training a neural network 106 using recorded audio messages 204. To begin the training, the recorded audio messages are converted into a series of audio frames 205 having a fixed duration 213. Then, each audio frame is assigned a phonetic representation 203 and a target acoustic representation 208, where the phonetic representation 203 is a binary word that represents the phone and articulation characteristics of the audio frame, while the target acoustic representation 208 is a vector of audio information such as pitch and energy. After training, the neural network 106 is used in conversion of text into speech. First, text that is to be convened is translated to a series of phonetic frames 401 of the same form as the phonetic representations 208 and having the fixed duration 213. Then the neural network produces acoustic representations in response to context descriptions 207 that include some of the phonetic frames 401. The acoustic representations are then converted into a speech wave form by a synthesizer 107.
-
公开(公告)号:FI955608A
公开(公告)日:1995-11-22
申请号:FI955608
申请日:1995-11-22
Applicant: MOTOROLA INC
Inventor: KARAALI ORHAN , CORRIGAN GERALD EDWARD , GERSON IRA ALAN
Abstract: Text may be converted to audible signals, such as speech, by first training a neural network 106 using recorded audio messages 204. To begin the training, the recorded audio messages are converted into a series of audio frames 205 having a fixed duration 213. Then, each audio frame is assigned a phonetic representation 203 and a target acoustic representation 208, where the phonetic representation 203 is a binary word that represents the phone and articulation characteristics of the audio frame, while the target acoustic representation 208 is a vector of audio information such as pitch and energy. After training, the neural network 106 is used in conversion of text into speech. First, text that is to be convened is translated to a series of phonetic frames 401 of the same form as the phonetic representations 208 and having the fixed duration 213. Then the neural network produces acoustic representations in response to context descriptions 207 that include some of the phonetic frames 401. The acoustic representations are then converted into a speech wave form by a synthesizer 107.
-
-
-
-
-
-
-
-
-