-
公开(公告)号:JPH1173195A
公开(公告)日:1999-03-16
申请号:JP20250898
申请日:1998-07-17
Applicant: LUCENT TECHNOLOGIES INC
Inventor: JUANG BIING-HWANG , LEE CHIN-HUI , LI QI P , ZHOU QIRU
Abstract: PROBLEM TO BE SOLVED: To eliminate the need for enormous investment of the time and labor for a training process by adopting a constitution in which one speech model series is determined in accordance with the partial word transcription of a word series and the partial word transcription consists of a partial word series. SOLUTION: The characteristic of spoken speech utterance is compared with one speech model series and the reliability level reflecting the word series consisting of >=1 words associated with an individual having proposed identification is determined. The one speech model series among the speech model series corresponds to the speech reflecting the word series associated with the individual having the proposed identification is determined. The one speech model series is determined in accordance with the partial word transcription of the word series. The partial word transcription consists of the partial word series consisting of >=1 partial words. A reliability measure module 34 judges the reliability measure, at which a password phrase associated with the individual having the proposed identification is actually the phrase of test utterance by using the strings of target likelihood evaluation points and the strings of anti- likelihood evaluation points.
-
公开(公告)号:JPH1173196A
公开(公告)日:1999-03-16
申请号:JP20250998
申请日:1998-07-17
Applicant: LUCENT TECHNOLOGIES INC
Inventor: JUANG BIING-HWANG , LEE CHIN-HUI , LI QI P , ZHOU QIRU
Abstract: PROBLEM TO BE SOLVED: To eliminate the need for enormous investment of the time and labor for a training process, by adopting a constitution, in which one speech model series among speech model series corresponds to the speech reflecting the information associated with an individual having proposed identification. SOLUTION: The characteristic of spoken speech utterance is compared with one speech model series and the reliability level reflecting the information associated with an individual having proposed identification is determined by this spoken speed utterance. The one speech model series among the speech model series corresponds to the speech reflecting the information associated with the individual having the proposed identification. A reliability measure module 34 judges the reliability measure, at which a password phrase associated with the individual having the proposed identification is actually the phrase of test utterance by using the strings of target likelihood evaluation points and the strings of anti-likelihood evaluation points.
-
公开(公告)号:JP2001159865A
公开(公告)日:2001-06-12
申请号:JP2000272404
申请日:2000-09-08
Applicant: LUCENT TECHNOLOGIES INC
Inventor: AUGUST KATHERINE G , BLACKWOOD NADINE , LI QI P , MCNERNEY MICHELLE , SHIH CHI-LIN , CHANDRASEKARAN SURENDRAN ARUN , ZHONG JIALIN , ZHOU QIRU
IPC: G09B5/06 , G06F3/048 , G06F3/16 , G06Q50/00 , G06T13/00 , G09B5/04 , G09B5/14 , G09B15/00 , G09B19/04 , G09B19/06 , G10L13/00 , G10L13/04 , G10L15/06 , G10L15/22 , G06F3/00 , G06F17/60 , G10L15/00 , G10L21/06
Abstract: PROBLEM TO BE SOLVED: To provide method and device for leading interactive language learning. SOLUTION: In the method and device for leading the interactive language learning, a text file for processing is displayed, the basic functions for interactive learning are provided, an animation of a face is displayed, and a work space for a language construction function is provided. This system is provided with a set of language rules stored as part of a text/voice conversion sub-system, and another set of stored rules applied to a process for learning the languages. The method realized by this system includes a step for converting the text to an audible voice, a step for supplying the audible voice to a user or a student (together with an animation image when selected), a step for prompting the student so as to repeat the audible voice, a step for comparing the repeated voice of the student with the audible voice provided by the system and a step for providing a student with feedback and support, e.g. for selecting a required part and reproducing the audible voice and the voice of the student toward the student.
-
公开(公告)号:JPH10307593A
公开(公告)日:1998-11-17
申请号:JP6345198
申请日:1998-03-13
Applicant: LUCENT TECHNOLOGIES INC
Inventor: LI QI P
Abstract: PROBLEM TO BE SOLVED: To improve speaker recognizing performance by efficiently performing probabilistic matching with a corresponding case of training voice data on aggregation of input test voice data. SOLUTION: A first covariance matrix to express a probabilistic characteristic of information on a characteristic of an input test voice is generated on the basis of information on a characteristic of a concerned input test voice. Then, the conversion of information on a characteristic of its input test voice is performed. Its conversion is based on the first covariance matrix and a second covariance matrix to express a probabilistic characteristic of information on a characteristic of a training voice. Information on a characteristic of an already converted input test voice having a probabilistic characteristic exactly adapted by a probabilistic characteristic of information on a characteristic of the training voice, can be successfully obtained as a result by such conversion. This conversion is also desirably based on a probabilistic average value of information on a characteristic of the training voice.
-
公开(公告)号:CA2239339A1
公开(公告)日:1999-01-18
申请号:CA2239339
申请日:1998-05-29
Applicant: LUCENT TECHNOLOGIES INC
Inventor: LEE CHIN-HUI , JUANG BIING-HWANG , LI QI P , ZHOU QIRU
Abstract: A method and apparatus for authenticating a proffered identity of a speaker in which the verbal information content of a speaker's utterance, rather than the v ocal characteristics of the speaker, are used to identify or verify the identity of a speaker. Specifically, features of a speech utterance spoken by a speaker are compared wi th at least one sequence of speaker-independent speech models, where one of these sequences of speech models corresponds to speech reflecting information associat ed with an individual having said proffered identity. Then, a confidence level that the speech utterance in fact reflects the information associated with the individual having said proffered identity is determined based on said comparison. In accordance wi th one illustrative embodiment, the proffered identity is an identity claimed by th e speaker, and the claimed identity is verified based upon the determined confiden ce level. In accordance with another illustrative embodiment, each of a plurality o f proffered identities is checked in turn to identify the speaker as being a parti cular one of a corresponding plurality of individuals. The features of the speech utteranc e may comprise cepstral (i.e., frequency) domain data, and the speaker-independent spe ech models may comprise Hidden Markov Models of individual phonemes. Since speaker-independent models are employed, the need for each system user to perfor m an individual training session is eliminated.
-
公开(公告)号:CA2317359A1
公开(公告)日:2001-03-09
申请号:CA2317359
申请日:2000-09-05
Applicant: LUCENT TECHNOLOGIES INC
Inventor: AUGUST KATHERINE G , SURENDRAN ARUN CHANDRASEKARAN , MCNERNEY MICHELLE , SHIH CHI-LIN , LI QI P , ZHOU QIRU , ZHONG JIALIN , BLACKWOOD NADINE
IPC: G09B5/06 , G06F3/048 , G06F3/16 , G06Q50/00 , G06T13/00 , G09B5/04 , G09B5/14 , G09B15/00 , G09B19/04 , G09B19/06 , G10L13/00 , G10L13/04 , G10L15/06 , G10L15/22 , G10L11/00
Abstract: A method and apparatus for interactive language instruction is provided th at displays text files for processing, provide key features and functions for interactive learning, displays facial animation, and provides a workspace for language building functions. The system includes a stored set of language rules as part of the text-to-speech sub-system, as well as another stored set of rules as applied to the process of learning a language. The method implemented by the system include s digitally converting text to audible speech, providing the audible speech to a user or student (with the aid of an animated image in selected circumstances), prompting the student to replicate the audible speech, comparing the student's replication with the audible speech provided by the system, and providing feedback and reinforcement to the student by, for example, selectively recording or playing back the audib le speech and the student's replication.
-
公开(公告)号:CA2317359C
公开(公告)日:2006-11-07
申请号:CA2317359
申请日:2000-09-05
Applicant: LUCENT TECHNOLOGIES INC
Inventor: ZHOU QIRU , MCNERNEY MICHELLE , ZHONG JIALIN , BLACKWOOD NADINE , LI QI P , SHIH CHI-LIN , AUGUST KATHERINE G , SURENDRAN ARUN CHANDRASEKARAN
IPC: G09B5/04 , G09B5/06 , G06F3/048 , G06F3/16 , G06Q50/00 , G06T13/00 , G09B5/14 , G09B15/00 , G09B19/04 , G09B19/06 , G10L13/00 , G10L13/04 , G10L15/06 , G10L15/22
Abstract: A method and apparatus for interactive language instruction is provided tha t displays text files for processing, provide key features and functions for interactive learning, displays facial animation, and provides a workspace for language building functions. The system includes a stored set of language rules as part of the text-to-speech sub-system, as well as another stored set of rules as applied to the process of learning a language. The method implemented by the system include s digitally converting text to audible speech, providing the audible speech to a user or student (with the aid of an animated image in selected circumstances), prompting the student to replicate the audible speech, comparing the student's replication with the audible speech provided by the system, and providing feedback and reinforcement to the student by, for example, selectively recording or playing back the audib le speech and the student's replication.
-
公开(公告)号:CA2239339C
公开(公告)日:2002-04-16
申请号:CA2239339
申请日:1998-05-29
Applicant: LUCENT TECHNOLOGIES INC
Inventor: LI QI P , ZHOU QIRU , LEE CHIN-HUI , JUANG BIING-HWANG
Abstract: A method and apparatus for authenticating a proffered identity of a speaker in which the verbal information content of a speaker's utterance, rather than t he vocal characteristics of the speaker, are used to identify or verify the identity of a speaker. Specifically, features of a speech utterance spoken by a speaker are compare d with at least one sequence of speaker-independent speech models, where one of these sequences of speech models corresponds to speech reflecting information asso ciated with an individual having said proffered identity. Then, a confidence level that the speech utterance in fact reflects the information associated with the indivi dual having said proffered identity is determined based on said comparison. In accordanc e with one illustrative embodiment, the proffered identity is an identity claimed b y the speaker, and the claimed identity is verified based upon the determined conf idence level. In accordance with another illustrative embodiment, each of a plurali ty of proffered identities is checked in turn to identify the speaker as being a p articular one of a corresponding plurality of individuals. The features of the speech utte rance may comprise cepstral (i.e., frequency) domain data, and the speaker-independent speech models may comprise Hidden Markov Models of individual phonemes. Since speaker-independent models are employed, the need for each system user to pe rform an individual training session is eliminated.
-
公开(公告)号:DE69800320D1
公开(公告)日:2000-10-26
申请号:DE69800320
申请日:1998-07-07
Applicant: LUCENT TECHNOLOGIES INC
Inventor: JUANG BIING-HWANG , LI QI P , LEE CHIN-HUI , ZHOU QIRU
-
公开(公告)号:DE69800320T2
公开(公告)日:2001-05-10
申请号:DE69800320
申请日:1998-07-07
Applicant: LUCENT TECHNOLOGIES INC
Inventor: JUANG BIING-HWANG , LI QI P , LEE CHIN-HUI , ZHOU QIRU
-
-
-
-
-
-
-
-
-