-
公开(公告)号:AT513367T
公开(公告)日:2011-07-15
申请号:AT08727788
申请日:2008-01-16
Applicant: QUALCOMM INC
Inventor: LEE CHONG U , JULIAN DAVID JONATHAN , GARUDADRI HARINATH , MAJUMDAR SOMDEB
IPC: H03M3/02
Abstract: Apparatus and method for processing signals. A sigma-delta modulator is used. An adaptive dynamic range controller is configured to adaptively adjust the dynamic range of a signal output from the sigma-delta modulator.
-
公开(公告)号:DE60233763D1
公开(公告)日:2009-10-29
申请号:DE60233763
申请日:2002-03-22
Applicant: QUALCOMM INC
Inventor: MALAYATH NARENDRANATH , DEJACO ANDREW P , CHANG CHIENCHUNG , JALIL SUHAIL , NING BI , GARUDADRI HARINATH
Abstract: A voice recognition (VR) system is disclosed that utilizes a combination of speaker independent (SI) and speaker dependent (SD) acoustic models. At least one SI acoustic model is used in combination with at least one SD acoustic model to provide a level of speech recognition performance that at least equals that of a purely SI acoustic model. The disclosed hybrid SI/SD VR system continually uses unsupervised training to update the acoustic templates in the one or more SD acoustic models. The hybrid VR system then uses the updated SD acoustic models in combination with the at least one SI acoustic model to provide improved VR performance during VR testing
-
公开(公告)号:DE602004010081T2
公开(公告)日:2008-09-11
申请号:DE602004010081
申请日:2004-03-23
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH , RAMCHANDRAN KANNAN
IPC: H04N7/26 , H03M13/29 , H04N7/66 , H04N19/895
Abstract: According to one aspect of the present invention, a method and apparatus is provided in which input data (e.g., input video data) is encoded in accordance with a first coding standard (e.g., MPEG-4) to generate encoded data. The input data is also encoded based on a reconstruction of the input data to generate encoded side information associated with the input data. The encoded data are transmitted to a destination (e.g., a decoding subsystem) over a first channel and the encoded side information are transmitted to the destination over a second channel. The encoded data and the encoded side information are decoded and combined at the destination to generate output data.
-
公开(公告)号:DE60036931T2
公开(公告)日:2008-08-07
申请号:DE60036931
申请日:2000-03-30
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH , DEJACO ANDREW P
IPC: G10L15/00 , G10L15/22 , G10L15/06 , H04M1/00 , H04M1/23 , H04M1/26 , H04M1/27 , H04M1/56 , H04M1/57 , H04M3/493 , H04M11/00
Abstract: A spoken user interface for speech-enabled devices includes a processor and a set of software instructions that are executable by the processor and stored in nonvolatile memory. A user of the speech-enabled device is prompted to enter a voice tag associated with an entry in a call history of the speech-enabled device. The call history includes lists of incoming and outgoing email messages, and incoming and outgoing telephone calls. The user is prompted to enter a voice tag after associated with a telephone number or email address in the call history after a user-selected number of telephone calls has been sent from the speech-enabled device to that telephone number, or has been sent from the telephone with that telephone number to the speech-enabled device, or after a user-selected number of email messages has been sent from the speech-enabled device to that email address, or has been sent from that email address to the speech-enabled device. The user may populate a phonebook of the speech-enabled device with email addresses by sending an email message to the speech-enabled device from a computer and including additional email addresses in the To: field and/or the CC: field of the email message.
-
公开(公告)号:ES2295895T3
公开(公告)日:2008-04-16
申请号:ES04758095
申请日:2004-03-23
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH , RAMCHANDRAN KANNAN
IPC: H04N7/26 , H03M13/29 , H04N7/66 , H04N19/895
Abstract: Un procedimiento que comprende: codificar datos de entrada de acuerdo con un primer estándar de codificación para generar datos codificados, codificar los datos de entrada basados en una reconstrucción de los datos de entrada a partir de los datos codificados para generar datos laterales codificados, y en el que la codificación de los datos de entrada basada en la reconstrucción de los datos de entrada comprende: clasificar bloques de los datos de entrada basados en su correlación con una reconstrucción de un fotograma actual de los datos de entrada, y transmitir los datos codificados por un primer canal y los datos laterales codificados por un segundo canal a un destino.
-
公开(公告)号:ES2288549T3
公开(公告)日:2008-01-16
申请号:ES02725288
申请日:2002-03-22
Applicant: QUALCOMM INC
Inventor: MALAYATH NARENDRANATH , DEJACO ANDREW P , CHANG CHIENCHUNG , JALIL SUHAIL , BI NING , GARUDADRI HARINATH
Abstract: Un aparato de reconocimiento de la voz que comprende: un modelo (230, 232) acústico independiente del hablante; un modelo (234) acústico dependiente del hablante que está confeccionado para un hablante; un motor (220) de reconocimiento de la voz; y un medio legible por un ordenador que almacena un conjunto de instrucciones para realizar la formación sin supervisión y la prueba de reconocimiento de voz, en el que el conjunto de instrucciones está adaptado para realizar una casación de patrones de la voz de entrada proveniente de dicho hablante con el contenido del mencionado modelo acústico independiente del hablante para producir referencias de casación de patrones independientes del hablante; para comparar las referencias de casación de patrones independientes del hablante con las referencias asociadas con las plantillas almacenadas en el mencionado modelo acústico dependiente del hablante; y si las referencias de casación de patrones independientes del hablante son más altos que las referencias asociadas con las plantillas almacenadas en el modelo acústico dependiente del hablante (234), almacenar una nueva plantilla en el mencionado modelo acústico dependiente del hablante en base a las referencias de casación de patrones independientes del hablante.
-
公开(公告)号:BRPI0510962A
公开(公告)日:2007-11-20
申请号:BRPI0510962
申请日:2005-05-13
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH , SAGETONG PHOOM , NANDA SANJIV , LUNDBY STEIN A
IPC: H04L29/06 , H04B7/00 , H04B7/216 , H04L12/28 , H04L12/56 , H04L12/66 , H04N7/26 , H04W28/06 , H04W72/12 , H04W84/04 , H04W88/18
Abstract: Methods and apparatus are described for transmitting information units over a plurality of constant bit rate communication channel. The techniques include encoding the information units, thereby creating a plurality of data packets. The encoding is constrained such that the data packet sizes match physical layer packet sizes of the communication channel. The information units may include a variable bit rate data stream, multimedia data, video data, and audio data. The communication channels include CMDA channels, WCDMA, GSM channels, GPRS channels, and EDGE channels.
-
公开(公告)号:BR0309685A
公开(公告)日:2007-05-29
申请号:BR0309685
申请日:2003-04-29
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH , SIVADAS SUNIL , HERMANSKY HYNEK , MORGAN NELSON H , WOOTERS CHARLES C , ADAMI ANDRE GUASTAVO , ORTUZAR MARIA CARMEN BENITEZ , BURGET LUKAS , DUPONT STEPHANE N , GREZL FRANTISEK , JAIN PRATIBHA , KAJAREKAR SACHIN , MOTLICEK PETR
Abstract: A distributed voice recognition system and method for obtaining acoustic features and speech activity at multiple frequencies by extracting high frequency components thereof on a device, such as a subscriber station and transmitting them to a network server having multiple stream processing capability, including cepstral feature processing, MLP nonlinear transformation processing, and multiband temporal pattern architecture processing. The features received at the network server are processed using all three streams, wherein each of the three streams provide benefits not available in the other two, thereby enhancing feature interpretation. Feature extraction and feature interpretation may operate at multiple frequencies, including but not limited to 8 kHz, 11 kHz, and 16 kHz.
-
39.
公开(公告)号:ES2273885T3
公开(公告)日:2007-05-16
申请号:ES01968568
申请日:2001-09-05
Applicant: QUALCOMM INC
Inventor: QI YINGYONG , BI NING , GARUDADRI HARINATH
Abstract: Un sistema de reconocimiento de voz (100), que comprende: una pluralidad de motores RV de reconocimiento de voz (104, 106, 108) con cada uno de los motores de reconocimiento de voz configurados para producir una palabra candidato; y un módulo de mapeo (110) que se configura para aceptar como entrada la palabra candidato de la pluralidad de motores de RV (104, 106, 108) y selecciona un candidato de palabra basado en una función de mapeo; Donde la función de mapeo es: En donde F es un primer motor de reconocimiento de voz, S es un segundo motor de reconocimiento de voz, F1wi es la distancia entre la pronunciación TU y la palabra candidato Wi, F2wi es la distancia para el segundo mejor candidato exluyendo Wi. Dg denota la distancia entre TU y la plantilla de desecho, S1wi es la distancia entre la pronunciación TU y Wi, S2wi es la distancia para el segundo mejor candidato excluyendo Wi, Sg denota la distancia entre TU y la plantilla de desecho, y ci = (i = 0, 1, ....n) es un coeficiente y el limite superior n es igual a la suma del número de motores RV más la suma de palabras candidato para cada motor RV.
-
公开(公告)号:DE60124408D1
公开(公告)日:2006-12-21
申请号:DE60124408
申请日:2001-09-05
Applicant: QUALCOMM INC
Inventor: QI YINGYONG , BI NING , GARUDADRI HARINATH
Abstract: A method and system that combines voice recognition engines and resolves differences between the results of individual voice recognition engines using a mapping function. Speaker independent voice recognition engines and speaker-dependent voice recognition engines are combined. Hidden Markov Model (HMM) engines and Dynamic Time Warping (DTW) engines are combined.
-
-
-
-
-
-
-
-
-