-
公开(公告)号:CA2068780C
公开(公告)日:1998-12-22
申请号:CA2068780
申请日:1992-05-15
Applicant: IBM
Inventor: BROWN PETER F , COCKE JOHN , DELLA PIETRA STEPHEN A , DELLA PIETRA VINCENT J , JELINEK FREDERICK , LAI JENNIFER C , MERCER ROBERT L
Abstract: The present invention is a system for translating text from a first source language into second target language. The system assigns probabilities or scores to various target-language translations and then displays or makes otherwise available the highest, scoring translations. The source text is first transduced into one or more intermediate structural representations. From these intermediate source structures a set of intermediate target-structure hypotheses is generated. These hypotheses are scored by two different models: a language model which assigns a probability or score to an intermediate target structure, and a translation model which assigns a probability or score to the event that an intermediate target structure is translated into an intermediate source structure. Scores from the translation model and language model are combined into a combined score for each intermediate target-structure hypothesis. Finally, a set of target-text hypotheses is produced by transducing the highest scoring target-structure hypotheses into portions of text into the target language. The system can either run into batch mode, in which case it translates source-language text into a target language without human assistance, or it can function as an aid to a human translator. When functioning as an aid to a human translator, the human may simply select from the various translation hypotheses provided by the system, or he may optionally provide hints or constraints on how to perform one or more of the states of source transduction, hypothesis generation and target transduction.
-
公开(公告)号:DE69315374D1
公开(公告)日:1998-01-08
申请号:DE69315374
申请日:1993-01-15
Applicant: IBM
Inventor: BROWN PETER FITZHUGH , DELLA PIETRA STEPHEN ANDREW , DELLA PIETRA VINCENT JOSEPH , MERCER ROBERT LEROY , JELINEK FREDERICK
Abstract: A speech recognition system displays a source text of one or more words in a source language. The system has an acoustic processor for generating a sequence of coded representations of an utterance to be recognized. The utterance comprises a series of one or more words in a target language different from the source language. A set of one or more speech hypotheses, each comprising one or more words from the target language, are produced. Each speech hypothesis is modeled with an acoustic model. An acoustic match score for each speech hypothesis comprises an estimate of the closeness of a match between the acoustic model of the speech hypothesis and the sequence of coded representations of the utterance. A translation match score for each speech hypothesis comprises an estimate of the probability of occurrence of the speech hypothesis given the occurrence of the source text. A hypothesis score for each hypothesis comprises a combination of the acoustic match score and the translation match score. At least one word of one or more speech hypotheses having the best hypothesis scores is output as a recognition result.
-
公开(公告)号:DE69315374T2
公开(公告)日:1998-05-28
申请号:DE69315374
申请日:1993-01-15
Applicant: IBM
Inventor: BROWN PETER FITZHUGH , DELLA PIETRA STEPHEN ANDREW , DELLA PIETRA VINCENT JOSEPH , MERCER ROBERT LEROY , JELINEK FREDERICK
Abstract: A speech recognition system displays a source text of one or more words in a source language. The system has an acoustic processor for generating a sequence of coded representations of an utterance to be recognized. The utterance comprises a series of one or more words in a target language different from the source language. A set of one or more speech hypotheses, each comprising one or more words from the target language, are produced. Each speech hypothesis is modeled with an acoustic model. An acoustic match score for each speech hypothesis comprises an estimate of the closeness of a match between the acoustic model of the speech hypothesis and the sequence of coded representations of the utterance. A translation match score for each speech hypothesis comprises an estimate of the probability of occurrence of the speech hypothesis given the occurrence of the source text. A hypothesis score for each hypothesis comprises a combination of the acoustic match score and the translation match score. At least one word of one or more speech hypotheses having the best hypothesis scores is output as a recognition result.
-
公开(公告)号:DE3681155D1
公开(公告)日:1991-10-02
申请号:DE3681155
申请日:1986-03-27
Applicant: IBM
Inventor: BAHL LALIT RAI , JELINEK FREDERICK , MERCER ROBERT LEROY
Abstract: A speech recognition system has an acoustic processor whic-generates a string of acoustic labels in response to speech input and a decoder which matches words in a vocabulary against generated labels in a string. A string of labels are generated in response to a speech input, and selecting words from a vocabulary as possible first words corresp. to labels at the beginning of the string. For a subject selected word, a most likely boundary label interval in the string is located where the subject selected word has the highest probability of ending. A respective likelihood of the subject selected word at each label interval of the string up to and including the most likely boundary label interval is evaluated. The process is repeated for each selected word as the subject selected word. A given selected word is classified as extendible if the likelihood at the partic. label interval corresp. to the most likely boundary label interval is within a predefined range of the highest likelihood for any selected word the partic. label interval.
-
公开(公告)号:CA2091912C
公开(公告)日:1996-12-03
申请号:CA2091912
申请日:1993-03-18
Applicant: IBM
Inventor: BROWN PETER F , DELLA PIETRA STEPHEN A , DELLA PIETRA VINCENT J , JELINEK FREDERICK , MERCER ROBERT L
Abstract: A speech recognition system displays a source text of one or more words in a source language. The system has an acoustic processor for generating a sequence of coded representations of an utterance to be recognized. The utterance comprises a series of one or more words in a target language different from the source language. A set of one or more speech hypotheses, each comprising one or more words from the target language, are produced. Each speech hypothesis is modeled with an acoustic model. An acoustic match score for each speech hypothesis comprises an estimate of the closeness of a match between the acoustic model of the speech hypothesis and the sequence of coded representations of the utterance. A translation match score for each speech hypothesis comprises an estimate of the probability of occurrence of the speech hypothesis given the occurrence of the source text. A hypothesis score for each hypothesis comprises a combination of the acoustic match score and the translation match score. At least one word of one or more speech hypotheses having the best hypothesis scores is output as a recognition result.
-
公开(公告)号:CA1248633A
公开(公告)日:1989-01-10
申请号:CA504800
申请日:1986-03-24
Applicant: IBM
Inventor: BAHL LALIT R , JELINEK FREDERICK , MERCER ROBERT L
Abstract: APPARATUS AND METHOD FOR DETERMINING A LIKELY WORD SEQUENCE FROM LABELS GENERATED BY AN AN ACOUSTIC PROCESSOR The present invention addresses the problem of determining, in a speech recognition context, a likely sequence or path of words from a plurality of word paths given a string of labels that are generated at successive intervals. The invention features multiple stack decoding and a unique strategy for extending one word path at a time without undue reliance on word path length. With multiple stack decoding, a stack is associated with each label of the label string. Word paths that most likely end at a given label are assigned to the stack corresponding to the given label and are ordered according to likelihood at the given label. The strategy of deciding which word path to extend includes the forming of a likelihood envelope against which the word paths are compared to determine if a word path is sufficiently likely to be extended. From among the word paths that are found to be extendible, the word path of highest likelihood in the earliest stack --i.e. the shortest most likely word path-- is selected for extension. After a word path is extended, it is deleted from its stack and the word paths extended therefrom are entered into appropriate stacks.
-
公开(公告)号:DE69230871D1
公开(公告)日:2000-05-11
申请号:DE69230871
申请日:1992-07-10
Applicant: IBM
-
公开(公告)号:CA2091912A1
公开(公告)日:1993-11-22
申请号:CA2091912
申请日:1993-03-18
Applicant: IBM
Inventor: BROWN PETER F , DELLA PIETRA STEPHEN A , DELLA PIETRA VINCENT J , JELINEK FREDERICK , MERCER ROBERT L
Abstract: A speech recognition system displays a source text of one or more words in a source language. The system has an acoustic processor for generating a sequence of coded representations of an utterance to be recognized. The utterance comprises a series of one or more words in a target language different from the source language. A set of one or more speech hypotheses, each comprising one or more words from the target language, are produced. Each speech hypothesis is modeled with an acoustic model. An acoustic match score for each speech hypothesis comprises an estimate of the closeness of a match between the acoustic model of the speech hypothesis and the sequence of coded representations of the utterance. A translation match score for each speech hypothesis comprises an estimate of the probability of occurrence of the speech hypothesis given the occurrence of the source text. A hypothesis score for each hypothesis comprises a combination of the acoustic match score and the translation match score. At least one word of one or more speech hypotheses having the best hypothesis scores is output as a recognition result.
-
公开(公告)号:CA2068780A1
公开(公告)日:1993-01-26
申请号:CA2068780
申请日:1992-05-15
Applicant: IBM
-
-
-
-
-
-
-
-