Abstract:
Disclosed are an encoding device and others capable of reducing the encoded information amount, the audio signal encoding error, and the decoded signal audio quality degradation. The device includes: a frequency region conversion unit (101) which converts an inputted audio signal into a frequency region; a band selection unit (102) which selects a quantization object band from a plurality of sub bands obtained by dividing the frequency region; and a shape quantization unit (103) which quantizes the shape of the frequency region parameter of the quantization object band. When a prediction encoding presence/absence judgment unit (104) judges that the number of common sub bands between the quantization object band and the quantization object band selected in the past is not smaller than a predetermined value, a gain quantization unit (105) performs prediction encoding on the gain of the frequency region parameter of the quantization object band. When the number of the common sub bands is smaller than the predetermined value, the gain quantization unit (105) directly quantizes the gain of the frequency region parameter of the quantization object band.
Abstract:
Se proporciona un dispositivo de codificación que puede obtener una calidad de sonido preferible para sentido auditivo incluso si el número de bits de información es pequeño. El dispositivo de codificación incluye una unidad de cuantificación de forma (111) que tiene: una unidad de búsqueda de sección (121) la cual busca un pulso para cada una de las bandas en las cuales está dividida una sección de búsqueda predeterminada, y una unidad de búsqueda completa (122) que lleva a cabo búsqueda de un pulso sobre la sección de búsqueda completa. La forma de un espectro de entrada es cuantificada por un número pequeño de posiciones de pulso y polaridades. Una unidad de cuantificación de ganancia (112) calcula una ganancia del pulso buscado por la unidad de cuantificación de forma (111) y cuantifica la ganancia para cada una de las bandas.
Abstract:
Provided is a voice encoding device which can accurately encode a spectrum shape of a signal having a strong tonality such as a vowel. The device includes: a sub-band constituting unit (151) which divides a first layer error conversion coefficient to be encoded into M sub-bands so as to generate M sub-band conversion coefficients; a shape vector encoding unit (152) which performs encoding on each of the M sub-band conversion coefficient so as to obtain M shape encoded information and calculates a target gain of each of the M sub-band conversion coefficients; a gain vector forming unit (153) which forms one gain vector by using M target gains; a gain vector encoding unit (154) which encodes the gain vector so as to obtain gain encoded information; and a multiplexing section unit (155) which multiplexes the shape encoded information with the gain encoded information.
Abstract:
Provided is an encoding device which can reduce the encoding distortion as compared to the conventional technique and can obtain a preferable sound quality for auditory sense. In the encoding device, a shape quantization unit (111) quantizes the shape of an input spectrum with a small number of pulse positions and polarities. The shape quantization unit (111) sets a pulse amplitude width to be searched later upon search of the pulse position to a value not greater than the pulse amplitude width which has been searched previously. A gain quantization unit (112) calculates a gain of a pulse searched by the shape quantization unit (111) for each of bands.
Abstract:
A signal decoding apparatus that can suppress any large unusual sounds to provide decoded signals of improved audibility even when the number of hierarchical layers to be used in the decoding process varies due to a packet loss or the like in communication utilizing a scalable encoding/decoding technique. In the signal decoding apparatus, a gain adjusting part (2308) adjusts, based on a control of a decoding control part (2301), the gain of a basic layer decoded signal outputted from a basic layer decoding part (2302). A gain adjusting part (2309) adjusts, based on a control of the decoding control part (2301), the gain of a first expansion layer decoded signal outputted from a first expansion layer decoding part (2303). A gain adjusting part (2310) adjusts, based on a control of the decoding control part (2301), the gain of a second expansion layer decoded signal outputted from a second expansion layer decoding part (2304).
Abstract:
Disclosed are an encoding device and encoding method capable of improving the quality of a decoded signal under very low bit rate conditions using a small amount of computation. A spectrum correction unit (302) performs correction processing on the subspectrum in each subband in such a manner that samples equal to or greater than a subspectrum average value are left unchanged while samples smaller than the subspectrum average value are replaced by zero. As a result of this, it is possible to significantly reduce the number of bits required to quantize the subspectrums without substantial reduction in quality in a local search unit (303) and in a multi-rate indexing unit (304).
Abstract:
Disclosed is a decoding device which can efficiently encode/decode spectral data in a high pass section of a broadband signal. In the disclosed device: a sample group extraction unit (372) partially selects spectral components by means of an ease of selection importance which is the extent that the spectral components come close to the spectral component having the maximum amplitude value, in the spectrum of a high pass estimated by means of first parameters contained in second encoded information and bands most approximated to each of the spectrums of a plurality of sub-bands calculated from the spectrum of a second decode signal; a logarithmic gain application unit (373) applies second parameters to the partially selected spectral components; and an interpolation processing unit (374) applies third parameters which are adaptively set according to the value of the second parameters, to the spectral components which were not partially selected.
Abstract:
Disclosed is a spectral smoothing device with a structure whereby smoothing is performed after a nonlinear conversion has been performed for a spectrum calculated from an audio signal, and with which the amount of processing calculation is significantly reduced while maintaining excellent audio quality. With this spectral smoothing device, a sub band division unit (102) divides an input spectrum into multiple sub bands; a representative value calculation unit (103) calculates a representative value for each sub band using an arithmetic mean and a geometric mean; with respect to each representative value, a nonlinear conversion unit (104) performs a nonlinear conversion the characteristic of which is further emphasized as the value increases; and a smoothing unit (105) that smoothes the representative value which has undergone the nonlinear conversion for each sub band, at the frequency domain.
Abstract:
Provided is a voice encoding device which can accurately encode a spectrum shape of a signal having a strong tonality such as a vowel. The device includes: a sub-band constituting unit (151) which divides a first layer error conversion coefficient to be encoded into M sub-bands so as to generate M sub-band conversion coefficients; a shape vector encoding unit (152) which performs encoding on each of the M sub-band conversion coefficient so as to obtain M shape encoded information and calculates a target gain of each of the M sub-band conversion coefficients; a gain vector forming unit (153) which forms one gain vector by using M target gains; a gain vector encoding unit (154) which encodes the gain vector so as to obtain gain encoded information; and a multiplexing section unit (155) which multiplexes the shape encoded information with the gain encoded information.