1.
    发明专利
    未知

    公开(公告)号:FI955608A0

    公开(公告)日:1995-11-22

    申请号:FI955608

    申请日:1995-11-22

    Applicant: MOTOROLA INC

    Abstract: Text may be converted to audible signals, such as speech, by first training a neural network 106 using recorded audio messages 204. To begin the training, the recorded audio messages are converted into a series of audio frames 205 having a fixed duration 213. Then, each audio frame is assigned a phonetic representation 203 and a target acoustic representation 208, where the phonetic representation 203 is a binary word that represents the phone and articulation characteristics of the audio frame, while the target acoustic representation 208 is a vector of audio information such as pitch and energy. After training, the neural network 106 is used in conversion of text into speech. First, text that is to be convened is translated to a series of phonetic frames 401 of the same form as the phonetic representations 208 and having the fixed duration 213. Then the neural network produces acoustic representations in response to context descriptions 207 that include some of the phonetic frames 401. The acoustic representations are then converted into a speech wave form by a synthesizer 107.

    A method and apparatus for converting text into audible signals using a neural network

    公开(公告)号:AU675389B2

    公开(公告)日:1997-01-30

    申请号:AU2104095

    申请日:1995-03-21

    Applicant: MOTOROLA INC

    Abstract: Text may be converted to audible signals, such as speech, by first training a neural network 106 using recorded audio messages 204. To begin the training, the recorded audio messages are converted into a series of audio frames 205 having a fixed duration 213. Then, each audio frame is assigned a phonetic representation 203 and a target acoustic representation 208, where the phonetic representation 203 is a binary word that represents the phone and articulation characteristics of the audio frame, while the target acoustic representation 208 is a vector of audio information such as pitch and energy. After training, the neural network 106 is used in conversion of text into speech. First, text that is to be convened is translated to a series of phonetic frames 401 of the same form as the phonetic representations 208 and having the fixed duration 213. Then the neural network produces acoustic representations in response to context descriptions 207 that include some of the phonetic frames 401. The acoustic representations are then converted into a speech wave form by a synthesizer 107.

    A Method and Apparatus for Converting Text Into Audible Signals Using a Neural Network

    公开(公告)号:CA2161540A1

    公开(公告)日:1995-11-09

    申请号:CA2161540

    申请日:1995-03-21

    Applicant: MOTOROLA INC

    Abstract: Text may be converted to audible signals, such as speech, by first training a neural network 106 using recorded audio messages 204. To begin the training, the recorded audio messages are converted into a series of audio frames 205 having a fixed duration 213. Then, each audio frame is assigned a phonetic representation 203 and a target acoustic representation 208, where the phonetic representation 203 is a binary word that represents the phone and articulation characteristics of the audio frame, while the target acoustic representation 208 is a vector of audio information such as pitch and energy. After training, the neural network 106 is used in conversion of text into speech. First, text that is to be convened is translated to a series of phonetic frames 401 of the same form as the phonetic representations 208 and having the fixed duration 213. Then the neural network produces acoustic representations in response to context descriptions 207 that include some of the phonetic frames 401. The acoustic representations are then converted into a speech wave form by a synthesizer 107.

    A method and apparatus for converting text into audible signals using a neural network

    公开(公告)号:AU2104095A

    公开(公告)日:1995-11-29

    申请号:AU2104095

    申请日:1995-03-21

    Applicant: MOTOROLA INC

    Abstract: Text may be converted to audible signals, such as speech, by first training a neural network 106 using recorded audio messages 204. To begin the training, the recorded audio messages are converted into a series of audio frames 205 having a fixed duration 213. Then, each audio frame is assigned a phonetic representation 203 and a target acoustic representation 208, where the phonetic representation 203 is a binary word that represents the phone and articulation characteristics of the audio frame, while the target acoustic representation 208 is a vector of audio information such as pitch and energy. After training, the neural network 106 is used in conversion of text into speech. First, text that is to be convened is translated to a series of phonetic frames 401 of the same form as the phonetic representations 208 and having the fixed duration 213. Then the neural network produces acoustic representations in response to context descriptions 207 that include some of the phonetic frames 401. The acoustic representations are then converted into a speech wave form by a synthesizer 107.

    5.
    发明专利
    未知

    公开(公告)号:FI955608A

    公开(公告)日:1995-11-22

    申请号:FI955608

    申请日:1995-11-22

    Applicant: MOTOROLA INC

    Abstract: Text may be converted to audible signals, such as speech, by first training a neural network 106 using recorded audio messages 204. To begin the training, the recorded audio messages are converted into a series of audio frames 205 having a fixed duration 213. Then, each audio frame is assigned a phonetic representation 203 and a target acoustic representation 208, where the phonetic representation 203 is a binary word that represents the phone and articulation characteristics of the audio frame, while the target acoustic representation 208 is a vector of audio information such as pitch and energy. After training, the neural network 106 is used in conversion of text into speech. First, text that is to be convened is translated to a series of phonetic frames 401 of the same form as the phonetic representations 208 and having the fixed duration 213. Then the neural network produces acoustic representations in response to context descriptions 207 that include some of the phonetic frames 401. The acoustic representations are then converted into a speech wave form by a synthesizer 107.

    A METHOD AND APPARATUS FOR CONVERTING TEXT INTO AUDIBLE SIGNALS USING A NEURAL NETWORK
    6.
    发明公开
    A METHOD AND APPARATUS FOR CONVERTING TEXT INTO AUDIBLE SIGNALS USING A NEURAL NETWORK 失效
    方法和设备用于将文本在音频信号中使用神经网络

    公开(公告)号:EP0710378A4

    公开(公告)日:1998-04-01

    申请号:EP95913782

    申请日:1995-03-21

    Applicant: MOTOROLA INC

    CPC classification number: G10L13/08 G10L25/30

    Abstract: Text may be converted to audible signals, such as speech, by first training a neural network using recorded audio messages (204). To begin the training, the recorded audio messages are converted into a series of audio frames (205) having a fixed duration (213). Then, each audio frame is assigned a phonetic representation (203) and a target acoustic representation, where the phonetic representation (203) is a binary word that represents the phone and articulation characteristics of the audio frame, while the target acoustic representation is a vector of audio information such as pitch and energy. After training, the neural network is used in conversion of text into speech. First, text that is to be converted is translated to a series of phonetic frames of the same form as the phonetic representations (203) and having the fixed duration (213). Then the neural network produces acoustic representations in response to context descriptions (207) that include some of the phonetic frames. The acoustic representations are then converted into a speech wave form by a synthesizer.

Patent Agency Ranking