METHOD AND APPARATUS FOR SYNTHESIZING SPEECH WITHOUT VOICING OR PITCH INFORMATION

    公开(公告)号:CA1324833C

    公开(公告)日:1993-11-30

    申请号:CA526482

    申请日:1986-12-30

    Applicant: MOTOROLA INC

    Abstract: A channel bank speech synthesizer for reconstructing speech from externally-generated acoustic feature inforamtion without using externally-generated voicing or pitch information is disclosed. An N-channel pitch-excited channel bank synthesizer (340) is provided having a first low-frequency group of channel gain values (1 to M) and a second high-frequency group of channel gain values (M+1 to N). The first group control a first group of amplitude modualtors (950) excited by a periodic pitch pulse source (920), and the second group controls amplitude modulators excited by a noise source (930). Both groups of modulated excitation signals are applied to the bandpass filters (960) to reconstruct the speech channels, and then combined at the summation network (970) to form a reconstructed synthesized speech signal. Additionally, the pitch pulse source (920) varies the pitch pulse period such that the pitch pulse rate decreases over the length of the word.

    METHOD FOR ENTERING DIGIT SEQUENCES BY VOICE COMMAND

    公开(公告)号:CA1312668C

    公开(公告)日:1993-01-12

    申请号:CA574731

    申请日:1988-08-15

    Applicant: MOTOROLA INC

    Abstract: CM00320H A user-interactive speech recognition control system is disclosed for recognizing a complete sequence of keywords (e.g., a telephone number such as 123-4567) via entering, verifying, and editing variable-length utterance strings (e.g., 1-2-3; 4-5; 6-7) separated by the user-defined placement of pauses. The device controller (120) utilizes timers (124) to monitor the pause time between partial-sequence digit strings recognized by the speech recognizer (110). When a string of digits is followed by a predetermined pause time interval, the recognized digits will be replied via the speech synthesizer (130). An additional string of digits can then be entered, and only the subsequent string will be replied after the next pause. Furthermore, the user has the flexibility to correct only the last digit string entered, or the entire sequence. Hence, if there is an error in only one digit, the erroneous digit string can be corrected without having to re-enter the entire digit sequence. The invention is well-suited to be used in a hands-free voice command dialing system for a mobile radiotelephone, wherein vehicular background noise may affect recognition accuracy.

    CONTINUOUS SPEECH RECOGNITION SYSTEM

    公开(公告)号:CA1301340C

    公开(公告)日:1992-05-19

    申请号:CA538501

    申请日:1987-06-01

    Applicant: MOTOROLA INC

    Inventor: GERSON IRA A

    Abstract: A continuous speech recognition system employs a grammar tree of alternative potentially recognized word paths. A technique of tracing back through the grammar tree is utilized in determining which partial word path is common to all potential word paths. The common partial word path is deleted and words corresponding to the deleted partial word path are output as recognized words.

    OPTICAL DECODING TECHNIQUE FOR SPEECH RECOGNITION

    公开(公告)号:CA1299752C

    公开(公告)日:1992-04-28

    申请号:CA525298

    申请日:1986-12-15

    Applicant: MOTOROLA INC

    Abstract: Described herein, is an arrangement and method for processing speech information in a speech recognition system. In such a system where the speech information is depicted as words, each word representing a sequence of frames and where the recognition system has means for comparing present input speech to a word template, the word template stored in template memory and derived from one or more previous input word, the prevent invention is best employed. The invention describes combining contiguous acoustically similar frames derived from the previous input word or words into representative frames to form a corresponding reduced word template, storing the reduced word template in template memory in an efficient manner, and comparing frames of the present input speech to the representative frames of the reduced word template according to the number of frames combined in the representative frames of the reduced word template. In doing so, a measure of similarity between the present input speech and the word template is generated.

    METHOD FOR TERMINATING A TELEPHONE CALL BY VOICE COMMAND

    公开(公告)号:CA1290871C

    公开(公告)日:1991-10-15

    申请号:CA574301

    申请日:1988-08-10

    Applicant: MOTOROLA INC

    Abstract: A reliable method for terminating a telephone call is disclosed using a specific sequence of steps performed by the hands-free control system. The invention requires that the call terminating command sequence be recognized as : (1) two separate speech utterances ( e.g., TERMINATE and CONVERSATION); (2) in proper sequence (e.g., TERMINATE first, then CONVERSATION); (3) with a maximum pause time interval between the end of the first utterance and the start of the second utterance (e.g., 300 millisecond); and (4) which meet predefined speech recognition matching criteria. Moreover, the present invention provides the user with a means to continue the telephone call in progress should the speech recognizer false, or if the user did not intend to speak the proper command. As a result, present invention enables a user to disconnect a telephone call by voice command with a high degree of reliability, even under high ambient noise conditions.

Patent Agency Ranking