Abstract:
Disclosed is an integrated voice/audio encoding/decoding apparatus whereby the overlap region of a window is adjusted based on the transition interval. In the present invention, encoding is carried out by making windows of relatively long length overlap in order to increase the efficiency of encoding, while the overlap region of the windows is reduced in accordance with the transition interval when there is a transition interval between frames, thereby making it possible to reduce the occurrence of noise due to transition intervals.
Abstract:
Provided is an apparatus for integrally encoding and decoding a speech signal and an audio signal. An encoding apparatus for integrally encoding a speech signal and an audio signal, may include: a module selection unit (110) to analyze a characteristic of an input signal and to select a first encoding module for encoding a first frame of the input signal; a speech encoding unit (130) to encode the input signal according to a selection of the module selection unit (110) and to generate a speech bitstream; an audio encoding unit (140) to encode the input signal according to the selection of the module selection unit (110) and to generate an audio bitstream; and a bitstream generation unit (150) to generate an output bitstream from the speech encoding unit (130) or the audio encoding unit (140) according to the selection of the module selection unit (110).
Abstract:
Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT- based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encodes the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
Abstract:
Disclosed are an apparatus and method for window processing for connecting between an MDCT frame and a heterogeneous frame, an encoding apparatus and encoding method, and a decoding apparatus and decoding method. The window processing method involves applying, to the MDCT frame, a window for interconnection between the MDCT frame and a frame having no aliasing term, so as to satisfy TDAC conditions for the recovery of an original signal.
Abstract:
The present invention relates to a window processing method and apparatus for interworking between an MDCT-TCX frame and a CELP frame. The window processing apparatus comprises: a coding mode decision unit for deciding a preceding subframe coding mode and a subsequent subframe coding mode, with respect to a current subframe; and a window application unit for applying a window that is determined according to the preceding subframe coding mode and the subsequent subframe coding mode, to the current subframe.
Abstract:
An encoding apparatus and a decoding apparatus in a transform between a Modified Discrete Cosine Transform (MDCT)-based coder and a hetero coder are provided. The encoding apparatus may encode additional information to restore an input signal encoded according to the MDCT-based coding scheme, when switching occurs between the MDCT-based coder and the hetero coder. Accordingly, an unnecessary bitstream may be prevented from being generated, and minimum additional information may be encoded.
Abstract:
A module (100) capable of appropriately selecting a linear predictive coding (LPC)-based (103) or a code excitation linear prediction (CELP)-based speech or audio encoder and a transform-based (104) audio encoder according to a feature of an input signal is a module (100) that performs as a bridge for overcoming a performance barrier between a conventional LPC-based encoder (103) and an audio encoder. Also, an integral audio encoder that provides consistent audio quality regardless of a type of the input audio signal can be designed based on the module.
Abstract:
Provided is a method and apparatus for encoding/decoding a multi-channel audio signal. The apparatus for encoding a multi-channel audio signal includes a frame converter for converting the multi-channel audio signal into a framed audio signal; means for downmixing the framed audio signal; means for encoding the downmixed audio signal; a source location information estimator for estimating source location information from the framed multi-channel audio signal; means for quantizing the estimated source location information; and means for multiplexing the encoded audio signal and the quantized source location information, to generate an encoded multi-channel audio signal.