Abstract:
PROBLEM TO BE SOLVED: To support advanced applications by combining images captured by two image sensors.SOLUTION: A mobile device 100 includes a first sensor 102, a second sensor 104, a sensor position controller 106, and a camera process pipeline 108. The pipeline implements techniques such as demosaicking, lens roll-off correction, scaling, color correction, color conversion, and spatial filtering. The sensor position controller adjusts locations and/or positions of the two sensors by rotating, shifting or sliding the sensors 102, 104. The sensors 102, 104 enable features such as image quality improvement, 3-D image and video visualization, and 360-degree panoramic video generation.
Abstract:
A frame erasure compensation method in a variable-rate speech coder includes quantizing, with a first encoder, a pitch lag value for a current frame and a first delta pitch lag value equal to the difference between the pitch lag value for the current frame and the pitch lag value for the previous frame. A second, predictive encoder quantizes only a second delta pitch lag value for the previous frame (equal to the difference between the pitch lag value for the previous frame and the pitch lag value for the frame prior to that frame). If the frame prior to the previous frame is processed as a frame erasure, the pitch lag value for the previous frame is obtained by subtracting the first delta pitch lag value from the pitch lag value for the current frame. The pitch lag value for the erasure frame is then obtained by subtracting the second delta pitch lag value from the pitch lag value for the previous frame. Additionally, a waveform interpolation method may be used to smooth discontinuities caused by changes in the coder pitch memory.
Abstract:
Methods and apparatus for quickly selecting an optimal excitation waveform from a codebook are presented herein. To reduce the number of computations required to choose the optimal codebook vector, a subset of codevectors are selected based upon optimal pulse locations, wherein the subset of codevectors form a subcodebook. Rather than searching the entire codebook, only the entries of the subcodebook are searched.
Abstract:
Methods and apparatus for quickly selecting an optimal excitation waveform from a codebook are presented herein. To reduce the number of computations required to choose the optimal codebook vector, a subset of codevectors are selected based upon optimal pulse locations, wherein the subset of codevectors form a subcodebook. Rather than searching the entire codebook, only the entries of the subcodebook are searched.
Abstract:
A method and apparatus for encoding a prototype waveform is disclosed comprising performing (614) a cross-correlation between a phase spectra of the prototype waveform and a phase spectra of a reference prototype waveform; generating (614) representatives for the maximum values of the cross-correlation; and quantizing (612, 616) an amplitude vector of the prototype waveform and the representatives; whereupon the amplitude vector and the representatives are transmitted as the encoded form of the prototype waveform. Also disclosed is a method and apparatus for reconstructing a prototype waveform, comprising generating (716) linear phase shift values from received phase parameters; composing (714) a modified phase vector from reference phases and the linear phase shift values; and generating (708, 704) a reconstructed current prototype from the modified phase vector and received amplitude parameters.
Abstract:
Un procedimiento para codificar una forma de onda prototipo, que comprende: la realización (614) de una correlación cruzada entre los espectros de fase de la forma de onda prototipo y los espectros de fase de una forma de onda prototipo de referencia; la generación (614) de representantes de los espectros de fase para los valores máximos de la correlación cruzada; y la cuantificación (612, 616) de un vector de amplitud de la forma de onda prototipo y de los representantes; donde el vector de amplitud y los representantes se transmiten como el formato codificado de la forma de onda prototipo.
Abstract:
A frame erasure compensation method in a variable-rate speech coder includes quantizing, with a first encoder, a pitch lag value for a current frame and a first delta pitch lag value equal to the difference between the pitch lag value for the current frame and the pitch lag value for the previous frame. A second, predictive encoder quantizes only a second delta pitch lag value for the previous frame (equal to the difference between the pitch lag value for the previous frame and the pitch lag value for the frame prior to that frame). If the frame prior to the previous frame is processed as a frame erasure, the pitch lag value for the previous frame is obtained by subtracting the first delta pitch lag value from the pitch lag value for the current frame. The pitch lag value for the erasure frame is then obtained by subtracting the second delta pitch lag value from the pitch lag value for the previous frame. Additionally, a waveform interpolation method may be used to smooth discontinuities caused by changes in the coder pitch memory.
Abstract:
Un aparato para generar (500) una trama de salida de cifrador de habla, comprendiendo dicho aparato: medios para extraer componentes de retardo de tono, componentes de amplitud, componentes de fase y componentes de información espectral de línea a partir de una pluralidad de tramas de habla sonora; medios para obtener vectores de error objetivo, según un esquema de cuantificación predictiva, para las componentes de retardo de tono, componentes de amplitud, componentes de fase y componentes de información espectral lineal; medios para cuantificar el vector de error objetivo de las componentes de retardo de tono, el vector de error objetivo de las componentes de amplitud, el vector de error objetivo de las componentes de fase, y el vector de error objetivo de las componentes de información espectral de línea; medios para combinar los vectores de error objetivo cuantificados de las componentes de retardo de tono, componentes de amplitud, componentes de fase, y componentes de información espectral de línea para formar la trama de salida de cifrador de habla.
Abstract:
A method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions includes a speech coder configured to select from among various predictive coding modes. After a predefined number of speech frames have been predictively coded, the speech coder codes one frame with a nonpredictive coding mode or a mildly predictive coding mode. The predefined number of frames can be determined in advance from the subjective standpoint of a listener. The predefined number of frames may be varied periodically. An average coding bit rate may be maintained for the speech coder by ensuring that an average coding bit rate is maintained for each successive pattern, or group, of predictively coded speech frames including at least one nonpredictively coded or mildly predictively coded speech frame.