Abstract:
Provided is an apparatus for integrally encoding and decoding a speech signal and an audio signal. An encoding apparatus for integrally encoding a speech signal and an audio signal, may include: a module selection unit (110) to analyze a characteristic of an input signal and to select a first encoding module for encoding a first frame of the input signal; a speech encoding unit (130) to encode the input signal according to a selection of the module selection unit (110) and to generate a speech bitstream; an audio encoding unit (140) to encode the input signal according to the selection of the module selection unit (110) and to generate an audio bitstream; and a bitstream generation unit (150) to generate an output bitstream from the speech encoding unit (130) or the audio encoding unit (140) according to the selection of the module selection unit (110).
Abstract:
Disclosed is an LPC residual signal encoding/decoding apparatus of an MDCT- based unified voice and audio encoding device. The LPC residual signal encoding apparatus analyzes a property of an input signal, selects an encoding method of an LPC filtered signal, and encodes the LPC residual signal based on one of a real filterbank, a complex filterbank, and an algebraic code excited linear prediction (ACELP).
Abstract:
An encoding apparatus and a decoding apparatus in a transform between a Modified Discrete Cosine Transform (MDCT)-based coder and a hetero coder are provided. The encoding apparatus may encode additional information to restore an input signal encoded according to the MDCT-based coding scheme, when switching occurs between the MDCT-based coder and the hetero coder. Accordingly, an unnecessary bitstream may be prevented from being generated, and minimum additional information may be encoded.
Abstract:
A module (100) capable of appropriately selecting a linear predictive coding (LPC)-based (103) or a code excitation linear prediction (CELP)-based speech or audio encoder and a transform-based (104) audio encoder according to a feature of an input signal is a module (100) that performs as a bridge for overcoming a performance barrier between a conventional LPC-based encoder (103) and an audio encoder. Also, an integral audio encoder that provides consistent audio quality regardless of a type of the input audio signal can be designed based on the module.
Abstract:
A Unified Speech and Audio Codec (USAC) that may process a window sequence based on mode switching is provided. The USAC may perform encoding or decoding by overlapping between frames based on a folding point when mode switching occurs. The USAC may process different window sequences for each situation to perform encoding or decoding, and thereby may improve a coding efficiency.
Abstract:
Disclosed are an apparatus and method for window processing for connecting between an MDCT frame and a heterogeneous frame, an encoding apparatus and encoding method, and a decoding apparatus and decoding method. The window processing method involves applying, to the MDCT frame, a window for interconnection between the MDCT frame and a frame having no aliasing term, so as to satisfy TDAC conditions for the recovery of an original signal.
Abstract:
The present invention relates to a window processing method and apparatus for interworking between an MDCT-TCX frame and a CELP frame. The window processing apparatus comprises: a coding mode decision unit for deciding a preceding subframe coding mode and a subsequent subframe coding mode, with respect to a current subframe; and a window application unit for applying a window that is determined according to the preceding subframe coding mode and the subsequent subframe coding mode, to the current subframe.
Abstract:
A method for visualization of multi-channel audio signals, a method for converting audio information by using a spatial cue, and an apparatus thereof are provided to visualize reproducing processes of the multi-channel signals and apply variation of the spatial cue to generation of the multi-channel signals, thereby reproducing realistic multi-channel audio signals and adjusting a position of a visual audio source of an audio signal. An apparatus for visualization of multi-channel audio signals, and converting audio information by using a spatial cue comprises the following parts: a SAC(Spatial Audio Coding) which is constituted with a SI decoder(102) for decoding the spatial cue and SAC decoder(101) for synthesizing multi-channel signals; a spatializer(103) and a visualizer(104) for receiving the decoded spatial cue information; and a displayer(105) for receiving a panning angle and power gain information of each channel.