-
111.
公开(公告)号:AU2004223383A1
公开(公告)日:2004-10-07
申请号:AU2004223383
申请日:2004-03-23
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH , RAMCHANDRAN KANNAN
IPC: H03M13/29 , H04N7/26 , H04N7/66 , H04N19/895 , H03M13/00
Abstract: According to one aspect of the present invention, a method and apparatus is provided in which input data (e.g., input video data) is encoded in accordance with a first coding standard (e.g., MPEG-4) to generate encoded data. The input data is also encoded based on a reconstruction of the input data to generate encoded side information associated with the input data. The encoded data are transmitted to a destination (e.g., a decoding subsystem) over a first channel and the encoded side information are transmitted to the destination over a second channel. The encoded data and the encoded side information are decoded and combined at the destination to generate output data.
-
公开(公告)号:BR0206413A
公开(公告)日:2004-06-22
申请号:BR0206413
申请日:2002-01-10
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH
Abstract: A method improves voice recognition by improving storage of voice recognition (VR) templates. The improved storage means that more VR models can be stored in memory. The more VR models that are stored in memory, the more robust the VR system, and therefore the more accurate the VR system. Lossy compression techniques are used to compress VR models. In one embodiment, Mu-law compression and A-law expansion are used to compress and expand VR models. In another embodiment, Mu-law compression and Mu-law expansion are used to compress and expand VR models. VR models are compressed during a training process, and they are expanded during voice recognition.
-
113.
公开(公告)号:HK1058428A1
公开(公告)日:2004-05-14
申请号:HK04101178
申请日:2004-02-19
Applicant: QUALCOMM INC
Inventor: QI YINGYONG , BI NING , GARUDADRI HARINATH
Abstract: A method and system that combines voice recognition engines and resolves differences between the results of individual voice recognition engines using a mapping function. Speaker independent voice recognition engines and speaker-dependent voice recognition engines are combined. Hidden Markov Model (HMM) engines and Dynamic Time Warping (DTW) engines are combined.
-
114.
公开(公告)号:CA2446936A1
公开(公告)日:2002-11-21
申请号:CA2446936
申请日:2002-05-17
Applicant: QUALCOMM INC , SPEECHWORKS INT INC
Inventor: GARUDADRI HARINATH , PHILLIPS MICHAEL STUART
Abstract: A system and method for transmitting speech activity in a distributed voice recognition system. The distributed voice recognition system includes a loca l VR engine in a subscriber unit and a server VR engine on a server. The local VR engine comprises a feature extraction (FE) module that extracts features from a speech signal, and a voice activity detection module (VAD) that detec ts voice activity within a speech signal. Indications of voice activity are transmitted ahead of features from the subscriber unit to the server.
-
公开(公告)号:AU2002255863A1
公开(公告)日:2002-10-15
申请号:AU2002255863
申请日:2002-03-22
Applicant: QUALCOMM INC
Inventor: DEJACO ANDREW P , GARUDADRI HARINATH , BI NING , MALAYATH NARENDRANATH , CHANG CHIENCHUNG , JALIL SUHAIL
Abstract: A voice recognition (VR) system is disclosed that utilizes a combination of speaker independent (SI) and speaker dependent (SD) acoustic models. At least one SI acoustic model is used in combination with at least one SD acoustic model to provide a level of speech recognition performance that at least equals that of a purely SI acoustic model. The disclosed hybrid SI/SD VR system continually uses unsupervised training to update the acoustic templates in the one or more SD acoustic models. The hybrid VR system then uses the updated SD acoustic models in combination with the at least one SI acoustic model to provide improved VR performance during VR testing
-
公开(公告)号:HK1043424A1
公开(公告)日:2002-09-13
申请号:HK02105127
申请日:2002-07-10
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH , DEJACO ANDREW P
IPC: G10L15/00 , H04M20060101 , G10L20060101 , G10L15/06 , G10L15/22 , H04M1/00 , H04M1/23 , H04M1/26 , H04M1/27 , H04M1/56 , H04M1/57 , H04M3/493 , H04M11/00
Abstract: A spoken user interface for speech-enabled devices includes a processor and a set of software instructions that are executable by the processor and stored in nonvolatile memory. A user of the speech-enabled device is prompted to enter a voice tag associated with an entry in a call history of the speech-enabled device. The call history includes lists of incoming and outgoing email messages, and incoming and outgoing telephone calls. The user is prompted to enter a voice tag after associated with a telephone number or email address in the call history after a user-selected number of telephone calls has been sent from the speech-enabled device to that telephone number, or has been sent from the telephone with that telephone number to the speech-enabled device, or after a user-selected number of email messages has been sent from the speech-enabled device to that email address, or has been sent from that email address to the speech-enabled device. The user may populate a phonebook of the speech-enabled device with email addresses by sending an email message to the speech-enabled device from a computer and including additional email addresses in the To: field and/or the CC: field of the email message.
-
公开(公告)号:AU3074002A
公开(公告)日:2002-07-01
申请号:AU3074002
申请日:2001-12-13
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH , DEJACO ANDREW P , CHANG CHIENCHUNG
Abstract: A novel and improved method and an accompanying apparatus provide for a distributed voice recognition (VR) capability in a remote device (201). Remote device (201) decides and controls what portions of the VR processing may take place at remote device (201) and what other portions may take place at a base station (202) in wireless communication with remote device (201).
-
公开(公告)号:AU4372400A
公开(公告)日:2000-10-16
申请号:AU4372400
申请日:2000-03-30
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH , DEJACO ANDREW P
IPC: G10L15/00 , G10L15/06 , G10L15/22 , H04M1/00 , H04M1/23 , H04M1/26 , H04M1/27 , H04M1/56 , H04M1/57 , H04M3/493 , H04M11/00
Abstract: A spoken user interface for speech-enabled devices includes a processor and a set of software instructions that are executable by the processor and stored in nonvolatile memory. A user of the speech-enabled device is prompted to enter a voice tag associated with an entry in a call history of the speech-enabled device. The call history includes lists of incoming and outgoing email messages, and incoming and outgoing telephone calls. The user is prompted to enter a voice tag after associated with a telephone number or email address in the call history after a user-selected number of telephone calls has been sent from the speech-enabled device to that telephone number, or has been sent from the telephone with that telephone number to the speech-enabled device, or after a user-selected number of email messages has been sent from the speech-enabled device to that email address, or has been sent from that email address to the speech-enabled device. The user may populate a phonebook of the speech-enabled device with email addresses by sending an email message to the speech-enabled device from a computer and including additional email addresses in the To: field and/or the CC: field of the email message.
-
公开(公告)号:AU3589500A
公开(公告)日:2000-08-25
申请号:AU3589500
申请日:2000-02-04
Applicant: QUALCOMM INC
Inventor: DEJACO ANDREW P , WALTERS RICHARD P , GARUDADRI HARINATH
Abstract: An apparatus for testing user interface integrity of speech-enabled devices includes a processor and a storage medium coupled to the processor. A set of voiced utterances is stored in the storage medium. A software module is executed by the processor to determine a state of the voice recognizer and provide a response to the voice recognizer in accordance with the determined state. The response may be to produce at least one voiced utterance in accordance with the state. The apparatus may be acoustically coupled to the voice recognizer. The apparatus may also, or in the alternative, be electrically coupled by a cable to the voice recognizer. The set of voiced utterances may include multiple sets of voiced utterances, each set having been spoken by a different person. The set of voiced utterances may also, or in the alternative, include multiple sets of voiced utterances, each set of voiced utterances having been spoken under different background noise conditions. The software module may also be executable to monitor the performance of the voice recognizer.
-
公开(公告)号:AU3589300A
公开(公告)日:2000-08-25
申请号:AU3589300
申请日:2000-02-04
Applicant: QUALCOMM INC
Inventor: BI NING , CHANG CHIENCHUNG , GARUDADRI HARINATH , DEJACO ANDREW P
Abstract: A voice recognition rejection scheme for capturing an utterance includes the steps accepting the utterance, applying an N-best algorithm to the utterance, or rejecting the utterance. The utterance is accepted if a first predefined relationship exists between one or more closest comparison results for the utterance with respect to a stored word and one or more differences between the one or more closest comparison results and one or more other comparison results between the utterance and one or more other stored words. An N-best algorithm is applied to the utterance if a second predefined relationship exists between the one or more closest comparison results and the one or more differences between the one or more closest comparison results and the one or more other comparison results. The utterance is rejected if a third predefined relationship exists between the one or more closest comparison results and the one or more differences between the one or more closest comparison results and the one or more other comparison results. One of the one or more other comparison results may advantageously be a next-closest comparison result for the utterance and another store word. The first, second, and third predefined relationships may advantageously be linear relationships.
-
-
-
-
-
-
-
-
-