-
公开(公告)号:JPH1173195A
公开(公告)日:1999-03-16
申请号:JP20250898
申请日:1998-07-17
Applicant: LUCENT TECHNOLOGIES INC
Inventor: JUANG BIING-HWANG , LEE CHIN-HUI , LI QI P , ZHOU QIRU
Abstract: PROBLEM TO BE SOLVED: To eliminate the need for enormous investment of the time and labor for a training process by adopting a constitution in which one speech model series is determined in accordance with the partial word transcription of a word series and the partial word transcription consists of a partial word series. SOLUTION: The characteristic of spoken speech utterance is compared with one speech model series and the reliability level reflecting the word series consisting of >=1 words associated with an individual having proposed identification is determined. The one speech model series among the speech model series corresponds to the speech reflecting the word series associated with the individual having the proposed identification is determined. The one speech model series is determined in accordance with the partial word transcription of the word series. The partial word transcription consists of the partial word series consisting of >=1 partial words. A reliability measure module 34 judges the reliability measure, at which a password phrase associated with the individual having the proposed identification is actually the phrase of test utterance by using the strings of target likelihood evaluation points and the strings of anti- likelihood evaluation points.
-
公开(公告)号:JP2000187496A
公开(公告)日:2000-07-04
申请号:JP33892899
申请日:1999-11-30
Applicant: LUCENT TECHNOLOGIES INC
Inventor: CHOU WU , RECCHIONE MICHAEL CHARLES , ZHOU QIRU
Abstract: PROBLEM TO BE SOLVED: To provide a method for automatic high degree voice recognition without a remarkable change to a secondary channel and a present standard by extracting plural voice characteristic signals from a received coding voice signal separately converging the received coding voice signal to an audio voice signal and applying them to a voice recognition system. SOLUTION: A hand set 201 generates a voice coder/parameter to transmit it to a radio base station 220 in response to a user input by utterance. In the radio base station 220, the received coding voice signal is provided on a path to be supplied to a public switch 230. Further, the coding voice signal received by the radio base station 220 is supplied to an automatic voice/speaker recognition (ASR) characteristic system 235 and an ASR system 240 arranged in the radio base station 220. In these systems, the decoded same coding display generating the voice signal is used as a substitute, and an ASR characteristic parameter is extracted with the ASR characteristic system 235 that the ASR system 240 uses.
-
公开(公告)号:JP2001159865A
公开(公告)日:2001-06-12
申请号:JP2000272404
申请日:2000-09-08
Applicant: LUCENT TECHNOLOGIES INC
Inventor: AUGUST KATHERINE G , BLACKWOOD NADINE , LI QI P , MCNERNEY MICHELLE , SHIH CHI-LIN , CHANDRASEKARAN SURENDRAN ARUN , ZHONG JIALIN , ZHOU QIRU
IPC: G09B5/06 , G06F3/048 , G06F3/16 , G06Q50/00 , G06T13/00 , G09B5/04 , G09B5/14 , G09B15/00 , G09B19/04 , G09B19/06 , G10L13/00 , G10L13/04 , G10L15/06 , G10L15/22 , G06F3/00 , G06F17/60 , G10L15/00 , G10L21/06
Abstract: PROBLEM TO BE SOLVED: To provide method and device for leading interactive language learning. SOLUTION: In the method and device for leading the interactive language learning, a text file for processing is displayed, the basic functions for interactive learning are provided, an animation of a face is displayed, and a work space for a language construction function is provided. This system is provided with a set of language rules stored as part of a text/voice conversion sub-system, and another set of stored rules applied to a process for learning the languages. The method realized by this system includes a step for converting the text to an audible voice, a step for supplying the audible voice to a user or a student (together with an animation image when selected), a step for prompting the student so as to repeat the audible voice, a step for comparing the repeated voice of the student with the audible voice provided by the system and a step for providing a student with feedback and support, e.g. for selecting a required part and reproducing the audible voice and the voice of the student toward the student.
-
公开(公告)号:JPH1173196A
公开(公告)日:1999-03-16
申请号:JP20250998
申请日:1998-07-17
Applicant: LUCENT TECHNOLOGIES INC
Inventor: JUANG BIING-HWANG , LEE CHIN-HUI , LI QI P , ZHOU QIRU
Abstract: PROBLEM TO BE SOLVED: To eliminate the need for enormous investment of the time and labor for a training process, by adopting a constitution, in which one speech model series among speech model series corresponds to the speech reflecting the information associated with an individual having proposed identification. SOLUTION: The characteristic of spoken speech utterance is compared with one speech model series and the reliability level reflecting the information associated with an individual having proposed identification is determined by this spoken speed utterance. The one speech model series among the speech model series corresponds to the speech reflecting the information associated with the individual having the proposed identification. A reliability measure module 34 judges the reliability measure, at which a password phrase associated with the individual having the proposed identification is actually the phrase of test utterance by using the strings of target likelihood evaluation points and the strings of anti-likelihood evaluation points.
-
公开(公告)号:CA2239339A1
公开(公告)日:1999-01-18
申请号:CA2239339
申请日:1998-05-29
Applicant: LUCENT TECHNOLOGIES INC
Inventor: LEE CHIN-HUI , JUANG BIING-HWANG , LI QI P , ZHOU QIRU
Abstract: A method and apparatus for authenticating a proffered identity of a speaker in which the verbal information content of a speaker's utterance, rather than the v ocal characteristics of the speaker, are used to identify or verify the identity of a speaker. Specifically, features of a speech utterance spoken by a speaker are compared wi th at least one sequence of speaker-independent speech models, where one of these sequences of speech models corresponds to speech reflecting information associat ed with an individual having said proffered identity. Then, a confidence level that the speech utterance in fact reflects the information associated with the individual having said proffered identity is determined based on said comparison. In accordance wi th one illustrative embodiment, the proffered identity is an identity claimed by th e speaker, and the claimed identity is verified based upon the determined confiden ce level. In accordance with another illustrative embodiment, each of a plurality o f proffered identities is checked in turn to identify the speaker as being a parti cular one of a corresponding plurality of individuals. The features of the speech utteranc e may comprise cepstral (i.e., frequency) domain data, and the speaker-independent spe ech models may comprise Hidden Markov Models of individual phonemes. Since speaker-independent models are employed, the need for each system user to perfor m an individual training session is eliminated.
-
公开(公告)号:CA2317359A1
公开(公告)日:2001-03-09
申请号:CA2317359
申请日:2000-09-05
Applicant: LUCENT TECHNOLOGIES INC
Inventor: AUGUST KATHERINE G , SURENDRAN ARUN CHANDRASEKARAN , MCNERNEY MICHELLE , SHIH CHI-LIN , LI QI P , ZHOU QIRU , ZHONG JIALIN , BLACKWOOD NADINE
IPC: G09B5/06 , G06F3/048 , G06F3/16 , G06Q50/00 , G06T13/00 , G09B5/04 , G09B5/14 , G09B15/00 , G09B19/04 , G09B19/06 , G10L13/00 , G10L13/04 , G10L15/06 , G10L15/22 , G10L11/00
Abstract: A method and apparatus for interactive language instruction is provided th at displays text files for processing, provide key features and functions for interactive learning, displays facial animation, and provides a workspace for language building functions. The system includes a stored set of language rules as part of the text-to-speech sub-system, as well as another stored set of rules as applied to the process of learning a language. The method implemented by the system include s digitally converting text to audible speech, providing the audible speech to a user or student (with the aid of an animated image in selected circumstances), prompting the student to replicate the audible speech, comparing the student's replication with the audible speech provided by the system, and providing feedback and reinforcement to the student by, for example, selectively recording or playing back the audib le speech and the student's replication.
-
公开(公告)号:DE69800320T2
公开(公告)日:2001-05-10
申请号:DE69800320
申请日:1998-07-07
Applicant: LUCENT TECHNOLOGIES INC
Inventor: JUANG BIING-HWANG , LI QI P , LEE CHIN-HUI , ZHOU QIRU
-
8.
公开(公告)号:CA2239340A1
公开(公告)日:1999-01-18
申请号:CA2239340
申请日:1998-05-29
Applicant: LUCENT TECHNOLOGIES INC
Inventor: ZHOU QIRU , JUANG BIING-HWANG , LI QI P , LEE CHIN-HUI
Abstract: A method and apparatus for authenticating a proffered identity of a speaker in which the verbal information content of a speaker's utterance, rather than the v ocal characteristics of the speaker, are used to identify or verify the identity of a speaker. Specifically, features of a speech utterance spoken by a speaker are compared wi th at least one sequence of speaker-independent speech models, where one of these sequences of speech models corresponds to speech reflecting information associat ed with an individual having said proffered identity. Then, a confidence level that the speech utterance in fact reflects the information associated with the individual having said proffered identity is determined based on said comparison. In accordance wi th one illustrative embodiment, the proffered identity is an identity claimed by th e speaker, and the claimed identity is verified based upon the determined confiden ce level. In accordance with another illustrative embodiment, each of a plurality o f proffered identities is checked in turn to identify the speaker as being a parti cular one of a corresponding plurality of individuals. The features of the speech utteranc e may comprise cepstral (i. e., frequency) domain data, and the speaker-independent sp eech models may comprise Hidden Markov Models of individual phonemes. Since speaker-independent models are employed, the need for each system user to perfor m an individual training session is eliminated.
-
公开(公告)号:CA2317359C
公开(公告)日:2006-11-07
申请号:CA2317359
申请日:2000-09-05
Applicant: LUCENT TECHNOLOGIES INC
Inventor: ZHOU QIRU , MCNERNEY MICHELLE , ZHONG JIALIN , BLACKWOOD NADINE , LI QI P , SHIH CHI-LIN , AUGUST KATHERINE G , SURENDRAN ARUN CHANDRASEKARAN
IPC: G09B5/04 , G09B5/06 , G06F3/048 , G06F3/16 , G06Q50/00 , G06T13/00 , G09B5/14 , G09B15/00 , G09B19/04 , G09B19/06 , G10L13/00 , G10L13/04 , G10L15/06 , G10L15/22
Abstract: A method and apparatus for interactive language instruction is provided tha t displays text files for processing, provide key features and functions for interactive learning, displays facial animation, and provides a workspace for language building functions. The system includes a stored set of language rules as part of the text-to-speech sub-system, as well as another stored set of rules as applied to the process of learning a language. The method implemented by the system include s digitally converting text to audible speech, providing the audible speech to a user or student (with the aid of an animated image in selected circumstances), prompting the student to replicate the audible speech, comparing the student's replication with the audible speech provided by the system, and providing feedback and reinforcement to the student by, for example, selectively recording or playing back the audib le speech and the student's replication.
-
公开(公告)号:DE69911723T2
公开(公告)日:2004-08-12
申请号:DE69911723
申请日:1999-11-23
Applicant: LUCENT TECHNOLOGIES INC
Inventor: CHOU WU , RECCHIONE MICHAEL CHARLES , ZHOU QIRU
Abstract: Automatic Speech Recognition (ASR) is achieved in wireless communications systems in which reliable ASR feature vector sequences are derived at a base station directly from digitally transmitted speech coder parameters, with no additional processing or signal modification required at the originating handset. No secondary channel need be provided for the transmission of ASR feature vectors. In operating on received speech coder parameters prior to conversion to a voice signal the present system and methods avoid the lossy conversion process and associated voice distortion. Since the received voice parameters are error protected during transmission they are received with greater accuracy.
-
-
-
-
-
-
-
-
-