-
公开(公告)号:DE3853294T2
公开(公告)日:1995-10-12
申请号:DE3853294
申请日:1988-08-24
Applicant: MOTOROLA INC
Inventor: GERSON IRA ALAN , LINDSLEY BRETT LOUIS
IPC: G10L15/28 , G10L15/00 , G10L15/22 , H04M1/00 , H04M1/27 , H04M1/60 , H04M9/08 , H04Q7/38 , H04B1/46 , G10L9/06
Abstract: A reliable method for terminating a telephone call is disclosed using a specific sequence of steps performed by hands-free control system. The invention requires that the call terminating command sequence be recognized as: two separate speech utterances (e.g., TERMINATE (158) and CONVERSATION (158)); in proper sequence (e.g. TERMINATE first, then CONVERSATION) with a maximum pause time interval (124) between the end of the first utterance and the start of the second utterance (e.g., 300 milliseconds) and which meet predefined speech recognition matching criteria (110). Moreover, the present invention provides the user with a procedure to continue the telephone call in progress should the speech recognizer make a false recognition or if the user did not intend to speak the proper command. As a result, present invention enables a user to disconnect a telephone call by voice command with a high degree of reliability, even under high ambient noise conditions.
-
公开(公告)号:DE3688614T2
公开(公告)日:1993-10-07
申请号:DE3688614
申请日:1986-12-18
Applicant: MOTOROLA INC
Inventor: GERSON IRA ALAN , LINDSLEY BRETT LOUIS , SMANSKI PHILIP JEROME
IPC: G10L11/00 , A61K35/14 , A61K38/00 , A61P31/12 , A61P35/00 , C07K1/20 , C07K14/395 , C07K14/52 , C07K14/54 , C12N15/09 , C12P21/02 , C12R1/865 , G10L15/06 , G10L15/10 , G10L15/18 , G10L5/00
Abstract: Described herein, is an arrangement and method for processing speech information in a speech recognition system (300). In such a system where the speech information is depicted as words, each word representing a sequence of frames (510) and where the recognition system has means (120) for comparing present input speech to a word template, the word template stored in template memory and derived from one or more previous input word, the present invention is best employed. The invention describes combining contiguous acoustically similar frames (512) derived from the previous input word or words into representative frames to form a corresponding reduced word template, storing the reduced word template in template memory in an efficient manner, and comparing frames of the present input speech to the representative frames of the reduced word template according to the number of frames combined in the representative frames of the reduced word template. In doing so, a measure of similarity between the present input speech and the word template is generated.
-
公开(公告)号:DE3688749D1
公开(公告)日:1993-08-26
申请号:DE3688749
申请日:1986-12-22
Applicant: MOTOROLA INC
Inventor: BORTH DAVID EDWARD , GERSON IRA ALAN , VILMUR RICHARD JOSEPH , LINDSLEY BRETT LOUIS
-
公开(公告)号:CA2186627A1
公开(公告)日:1996-08-29
申请号:CA2186627
申请日:1995-12-14
Applicant: MOTOROLA INC
Inventor: AUYEUNG CHEUNG , LINDSLEY BRETT LOUIS , LEVINE STEPHEN NORMAN
IPC: H04N7/32 , H04N7/50 , H04N21/234 , H04N21/44 , H04N7/26
Abstract: The present invention is method and apparatus for preventing overflow and underflow of an encoder buffer in a video compression system. A virtual buffer is created in a rate controller to model the decoder buffer fullness (102). A sequence of bits is generated by an encoder (104). The encoder is controlled by the rate controller to prevent a decoder buffer underflow and overflow. Then, the sequence of bits is received by the encoder buffer to produce a bitstream (106). The bitstream corresponds to an instantaneous channel bitrate. The bitstream is transmitted from the encoder buffer to a decoder buffer following a delay (108). The delay is controlled by a rate controller to synchronize an encoder buffer fullness with a virtual buffer fullness (110). The synchronization prevents overflow and underflow of the encoder buffer.
-
公开(公告)号:AT119724T
公开(公告)日:1995-03-15
申请号:AT88908527
申请日:1988-08-24
Applicant: MOTOROLA INC
Inventor: GERSON IRA ALAN , LINDSLEY BRETT LOUIS
IPC: G10L15/28 , G10L15/00 , G10L15/22 , H04M1/00 , H04M1/27 , H04M1/60 , H04M9/08 , H04Q7/38 , H04B1/46 , G10L9/06
Abstract: A reliable method for terminating a telephone call is disclosed using a specific sequence of steps performed by hands-free control system. The invention requires that the call terminating command sequence be recognized as: two separate speech utterances (e.g., TERMINATE (158) and CONVERSATION (158)); in proper sequence (e.g. TERMINATE first, then CONVERSATION) with a maximum pause time interval (124) between the end of the first utterance and the start of the second utterance (e.g., 300 milliseconds) and which meet predefined speech recognition matching criteria (110). Moreover, the present invention provides the user with a procedure to continue the telephone call in progress should the speech recognizer make a false recognition or if the user did not intend to speak the proper command. As a result, present invention enables a user to disconnect a telephone call by voice command with a high degree of reliability, even under high ambient noise conditions.
-
公开(公告)号:MX165502B
公开(公告)日:1992-11-16
申请号:MX1336888
申请日:1988-10-11
Applicant: MOTOROLA INC
Inventor: GERSON IRA ALAN , LINDSLEY BRETT LOUIS
Abstract: A user-interactive speech recognition control system is disclosed for recognizing a complete sequence of keywords (e.g., a telephone number such as 123-4567) via entering, verifying, and editing variable-length utterance strings (e.g., 1-2-3; 4-5; 6-7) separated by the user-defined placement of pauses. The device controller (120) utilizes timers (124) to monitor the pause time between partial-sequence digit strings recognized by the speech recognizer (110). When a string of digits is followed by a predetermined pause time interval, the recognized digits will be replied via the speech synthesizer (130). An additional string of digits can then be entered, and only the subsequent string will be replied after the next pause. Furthermore, the user has the flexibility to correct only the last digit string entered, or the entire sequence. Hence, if there is an error in only one digit, the erroneous digit string can be corrected without having to re-enter the entire digit sequence. The invention is well-suited to be used in a hands-free voice command dialing system for a mobile radiotelephone, wherein vehicular background noise may affect recognition accuracy.
-
公开(公告)号:AU609527B2
公开(公告)日:1991-05-02
申请号:AU2382688
申请日:1988-08-24
Applicant: MOTOROLA INC
Inventor: GERSON IRA ALAN , LINDSLEY BRETT LOUIS
IPC: G10L15/28 , G10L15/00 , G10L15/22 , H04M1/00 , H04M1/27 , H04M1/60 , H04M9/08 , H04Q7/38 , H04B1/46 , G10L9/06 , H04Q7/04
Abstract: A reliable method for terminating a telephone call is disclosed using a specific sequence of steps performed by hands-free control system. The invention requires that the call terminating command sequence be recognized as: two separate speech utterances (e.g., TERMINATE (158) and CONVERSATION (158)); in proper sequence (e.g. TERMINATE first, then CONVERSATION) with a maximum pause time interval (124) between the end of the first utterance and the start of the second utterance (e.g., 300 milliseconds) and which meet predefined speech recognition matching criteria (110). Moreover, the present invention provides the user with a procedure to continue the telephone call in progress should the speech recognizer make a false recognition or if the user did not intend to speak the proper command. As a result, present invention enables a user to disconnect a telephone call by voice command with a high degree of reliability, even under high ambient noise conditions.
-
公开(公告)号:AU678926B2
公开(公告)日:1997-06-12
申请号:AU4520096
申请日:1995-12-14
Applicant: MOTOROLA INC
Inventor: AUYEUNG CHEUNG , LINDSLEY BRETT LOUIS , LEVINE STEPHEN NORMAN
Abstract: The present invention is method and apparatus for preventing overflow and underflow of an encoder buffer in a video compression system. A virtual buffer is created in a rate controller to model the decoder buffer fullness (102). A sequence of bits is generated by an encoder (104). The encoder is controlled by the rate controller to prevent a decoder buffer underflow and overflow. Then, the sequence of bits is received by the encoder buffer to produce a bitstream (106). The bitstream corresponds to an instantaneous channel bitrate. The bitstream is transmitted from the encoder buffer to a decoder buffer following a delay (108). The delay is controlled by a rate controller to synchronize an encoder buffer fullness with a virtual buffer fullness (110). The synchronization prevents overflow and underflow of the encoder buffer.
-
公开(公告)号:AT136146T
公开(公告)日:1996-04-15
申请号:AT88909654
申请日:1988-08-24
Applicant: MOTOROLA INC
Inventor: GERSON IRA ALAN , LINDSLEY BRETT LOUIS
Abstract: A user-interactive speech recognition control system is disclosed for recognizing a complete sequence of keywords (e.g., a telephone number such as 123-4567) via entering, verifying, and editing variable-length utterance strings (e.g., 1-2-3; 4-5; 6-7) separated by the user-defined placement of pauses. The device controller (120) utilizes timers (124) to monitor the pause time between partial-sequence digit strings recognized by the speech recognizer (110). When a string of digits is followed by a predetermined pause time interval, the recognized digits will be replied via the speech synthesizer (130). An additional string of digits can then be entered, and only the subsequent string will be replied after the next pause. Furthermore, the user has the flexibility to correct only the last digit string entered, or the entire sequence. Hence, if there is an error in only one digit, the erroneous digit string can be corrected without having to re-enter the entire digit sequence. The invention is well-suited to be used in a hands-free voice command dialing system for a mobile radiotelephone, wherein vehicular background noise may affect recognition accuracy.
-
公开(公告)号:HK40496A
公开(公告)日:1996-03-15
申请号:HK40496
申请日:1996-03-07
Applicant: MOTOROLA INC
Inventor: GERSON IRA ALAN , LINDSLEY BRETT LOUIS , SMANSKI PHILIP JEROME
IPC: G10L11/00 , A61K35/14 , A61K38/00 , A61P31/12 , A61P35/00 , C07K1/20 , C07K14/395 , C07K14/52 , C07K14/54 , C12N15/09 , C12P21/02 , C12R1/865 , G10L15/06 , G10L15/10 , G10L15/18 , G10L5/00
Abstract: Described herein, is an arrangement and method for processing speech information in a speech recognition system (300). In such a system where the speech information is depicted as words, each word representing a sequence of frames (510) and where the recognition system has means (120) for comparing present input speech to a word template, the word template stored in template memory and derived from one or more previous input word, the present invention is best employed. The invention describes combining contiguous acoustically similar frames (512) derived from the previous input word or words into representative frames to form a corresponding reduced word template, storing the reduced word template in template memory in an efficient manner, and comparing frames of the present input speech to the representative frames of the reduced word template according to the number of frames combined in the representative frames of the reduced word template. In doing so, a measure of similarity between the present input speech and the word template is generated.
-
-
-
-
-
-
-
-
-