-
公开(公告)号:BR0309685A
公开(公告)日:2007-05-29
申请号:BR0309685
申请日:2003-04-29
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH , SIVADAS SUNIL , HERMANSKY HYNEK , MORGAN NELSON H , WOOTERS CHARLES C , ADAMI ANDRE GUASTAVO , ORTUZAR MARIA CARMEN BENITEZ , BURGET LUKAS , DUPONT STEPHANE N , GREZL FRANTISEK , JAIN PRATIBHA , KAJAREKAR SACHIN , MOTLICEK PETR
Abstract: A distributed voice recognition system and method for obtaining acoustic features and speech activity at multiple frequencies by extracting high frequency components thereof on a device, such as a subscriber station and transmitting them to a network server having multiple stream processing capability, including cepstral feature processing, MLP nonlinear transformation processing, and multiband temporal pattern architecture processing. The features received at the network server are processed using all three streams, wherein each of the three streams provide benefits not available in the other two, thereby enhancing feature interpretation. Feature extraction and feature interpretation may operate at multiple frequencies, including but not limited to 8 kHz, 11 kHz, and 16 kHz.
-
公开(公告)号:AU2003225235A1
公开(公告)日:2003-11-17
申请号:AU2003225235
申请日:2003-04-29
Applicant: QUALCOMM INC
Inventor: GARUDADRI HARINATH , SIVADAS SUNIL , HERMANSKY HYNEK , MORGAN NELSON H , WOOTERS CHARLES C , ADAMI ANDRE GUSTAVO , ORTUZAR MARIA CARMEN BENITEZ , BURGET LUKAS , DUPONT STEPHANE N , GREZL FRANTISEK , JAIN PRATIBHA , KAJAREKAR SACHIN , MOTLICEK PETR
Abstract: A distributed voice recognition system and method for obtaining acoustic features and speech activity at multiple frequencies by extracting high frequency components thereof on a device, such as a subscriber station and transmitting them to a network server having multiple stream processing capability, including cepstral feature processing, MLP nonlinear transformation processing, and multiband temporal pattern architecture processing. The features received at the network server are processed using all three streams, wherein each of the three streams provide benefits not available in the other two, thereby enhancing feature interpretation. Feature extraction and feature interpretation may operate at multiple frequencies, including but not limited to 8 kHz, 11 kHz, and 16 kHz.
-