Abstract:
Disclosed is a stereo acoustic signal encoding apparatus in which the signal quality does not deteriorate if there are a plurality of sound sources. A peak tracing unit (401) splits frames of a right channel signal and a left channel signal into a plurality of sub frames; detects the peaks of wave shapes of the split sub frames; and estimates a frame delay time D for each frame of the right channel signal and the left channel signal by comparing the positions of the detected peaks. A time adjusting unit (402) adjusts the time of the right channel signal on the basis of the frame time delay D. A down-mix operation is carried out using the right channel signal which has been subjected to the time adjustment and the left channel signal to generate a mono signal and a sub signal. A mono signal encoding unit (403) encodes the mono signal. A sub signal encoding unit (404) encodes the sub signal. The time delay encoding unit (405) encodes the frame time delay D.
Abstract:
The purpose of the present invention is to more efficiently extend, using a low bit rate, the bandwidth of input signals having a harmonics structure, in order to obtain better audio quality. The present invention is installed in a device that extends bandwidth for audio signal encoding and decoding. This novel bandwidth extension encoding identifies a low-frequency spectrum component having the highest correlation to a high-frequency bandwidth signal among input signals, duplicates a high-frequency spectrum by energy adjustment of said component, and maintains the harmonic relationship between the low-frequency spectrum and the duplicated high-frequency spectrum by adjusting the spectral peak position of the duplicated high-frequency spectrum, on the basis of a harmonic frequency estimated from a composite low-frequency spectrum.
Abstract:
An audio encoding apparatus that allows a decoded signal exhibiting an excellent sound quality to be obtained on a decoding side. In the audio encoding apparatus (1000A), a time-frequency transform unit (1001) uses a time-frequency transform, such as a discrete Fourier transform (DFT) or a modified discrete cosine transform (MDCT), to transform a time domain signal (S(n)) to a frequency domain signal (spectrum factor) (S(f)). A psychoacoustic model analyzing unit (1002) performs a psychoacoustic model analysis of the frequency domain signal (S(f)), thereby obtaining a masking curve. An acoustic sense weighting unit (1003) estimates, based on the masking curve, an importance degree of acoustic sense, and determines and applies the weighting factors of respective spectrum factors to the respective spectrum factors. An encoding unit (1004) encodes the frequency domain signal (S(f)) as weighted in terms of the acoustic sense. A multiplexing unit (1005) multiplexes and transmits the encoded parameters.