Abstract:
A method and system that combines voice recognition engines and resolves differences between the results of individual voice recognition engines using a mapping function. Speaker independent voice recognition engines and speaker-dependent voice recognition engines are combined. Hidden Markov Model (HMM) engines and Dynamic Time Warping (DTW) engines are combined.
Abstract:
A method for processing digitized speech signals by analyzing redundant features to provide more robust voice recognition. A primary transformation is applied to a source speech signal to extract primary features therefrom. Each of at least one secondary transformation is applied to the source speech signal or extracted primary features to yield at least one set of secondary features statistically dependant on the primary features. At least one predetermined function is then applied to combine the primary features with the secondary features. A recognition answer is generated by pattern matching this combination against predetermined voice recognition templates.
Abstract:
A method and system that improves voice recognition in a distributed voice recognition system. A distributed voice recognition system includes a local VR engine in a subscriber unit and a server VR engine on a server. When the local VR engine does not recognize a speech segment, the local VR engine sends information of the speech segment to the server VR engine If the speech segment is recognized by the server VR engine, then the server VR engine downloads information corresponding the speech segment to the local VR engine. The local VR engine may combine its speech segement information with downloaded information to create resultant information for a speech segment. The local VR engine may also apply a function to downloaded information to create resultant information. Resultant information then may be uploaded from the local VR engine to the server VR engine.
Abstract:
A method and system that combines voice recognition engines (104, 108, 112, 114) and resolves differences between the results of individual voice recognition engines (104, 106, 108, 112, 114) using a mapping function. Speaker independent voice recognition engine (104) and speaker-dependent voice recognition engine (106) are combined. Hidden Markov Model (HMM) engines (108, 114) and Dynamic Time Warping (DTW) engines (104, 106, 112) are combined.
Abstract:
A method, apparatus, and system for providing distributed source coding techniques that improve data coding performance, such as video data coding, when channel errors or losses occur. Errors in the reconstruction of the data is eliminated or reduced by sending extra information. Correlation between a predicted sequence and an original sequence can be used to design codebooks and find the cosets required to represent the original image. This information may be sent over another channel, or a secondary channel.
Abstract:
Methods and apparatus are described for improving the transmission of information over wireless communication channels. These techniques include determining available communication channels for transmitting information and determining possible physical layer packet sizes of the available channels. An information unit is partitioned into portions wherein the size of the portions are selected so as to match one of the physical layer packet sizes of the available communication channels. Another aspect is partitioning the information into a number of slices that correspond to the number of transmissions that occur during the information unit interval and assigning each partition to a corresponding transmission. The techniques can be used for various types of information, such as multimedia data, variable bit rate data streams, video data, or audio data. The techniques can also be used with various over the air interfaces, such as, Global System for Mobile Communication (GSM), General Packet Radio Service (GPRS), Enhanced Data GSM Environment (EDGE), or standards based on CDMA such as TIA/EIA-95-B (IS- 95), TIA/EIA-98-C (IS-98), IS2000, HRPD, cdma2000, Wideband CDMA (WCDMA), and others.
Abstract:
Techniques are described to reduce rounding errors during computation of discrete cosine transform using fixed-point calculations. According to these techniques, a discrete cosine transform a matrix of scaled coefficients is calculated by multiplying coefficients in a matrix of coefficients by scale factors. Next, a midpoint bias value and a supplemental bias value are added to a DC coefficient of the matrix of scaled coefficients. Next, an inverse discrete cosine transform is applied to the resulting matrix of scaled coefficients. Values in the resulting matrix are then right-shifted in order to derive a matrix of pixel component values. As described herein, the addition of the supplemental bias value to the DC coefficient reduces rounding errors attributable to this right-shifting. As a result, a final version of a digital media file decompressed using these techniques may more closely resemble an original version of a digital media file.
Abstract:
Methods and apparatus are described for transmitting information units over a plurality of constant bit rate communication channel. The techniques include encoding the information units, thereby creating a plurality of data packets. The encoding is constrained such that the data packet sizes match physical layer packet sizes of the communication channel. The information units may include a variable bit rate data stream, multimedia data, video data, and audio data. The communication channels include CMDA channels, WCDMA, GSM channels, GPRS channels, and EDGE channels.
Abstract:
Un procedimiento para transmitir información de capa de aplicación en un sistema de comunicación inalámbrica (100), comprendiendo el procedimiento: determinar (2004) tamaños posibles de paquete de capa física de una pluralidad de canales disponibles de comunicación de tasa de bits constante; establecer limitaciones para particionar unidades de información de capa de aplicación de manera que las particiones se dimensionen para que el tamaño de cada partición coincida con uno de los tamaños de paquete de la capa física determinados disponibles en la pluralidad de canales de comunicación disponibles inalámbrica de tasa de bits constante, caracterizado porque esta etapa comprende particionar una unidad de información de capa de aplicación en múltiples particiones descodificables de manera independiente, de manera que exista una correspondencia de uno a uno entre las unidades de información de capa de aplicación particionadas y los paquetes de la capa física comunicados por los canales de comunicación disponibles.