Abstract:
A multi-dimensional transform and quantization method and an apparatus thereof by using fundamental blocks and reducing the complexity are provided to improve the performance of video data compression by performing the conversion by collecting conversion coefficients. A converting unit(210) performs first conversion about inputted video data. The converting unit extracts the conversion factors of the same frequency from video data. The converting unit performs the second conversion about the extracted conversion factors. A quantizing unit(230) quantizes conversion factors transformed from the second to the fixed numbers. A scale unit(220) produces the first scale factor of video data in the first conversion and the second scale factor of video data in the second conversion.
Abstract:
An encoding device for encoding an intra image based on reference block waveform efficiently decoding the image is provided to improve the image quality and reduce the encoded size of the image by efficiently encoding an input image. An input image dividing unit(210) divides the input image into a plurality of image blocks including the first image block and the second image block. A waveform information generating unit(220) selects a plurality of reference pixels between pixels included in the first image block and a plurality of reference pixels. The waveform information generating unit produces the first corrugated information about the first image block based on the difference of the pixel values between the reference pixels. Second waveform information about a second image block is generated based on the difference included in the second image block of the pixel value.
Abstract:
오디오 신호의 무손실 부호화/복호화 장치 및 그 방법이 개시된다. 현 프레임의 주파수 계수에 대한 양자화 인덱스들로부터 각 레벨에 대한 제1 비트열을 생성하고, 각 레벨에 대한 제1 비트열을 일렬로 나열한 제2 비트열의 런 길이로 구성되는 심볼을 생성한 후, 심볼을 제3 비트열로 부호화한다. 이로써, 오디오 신호의 부호화 성능을 향상시킬 수 있다.
Abstract:
A quantizer and method of an LSF coefficient in a wide-band speech coder using trellis coded quantization algorithm are provided to improve an SD performance and assigned bits by reducing an error transfer as a result of using in parallel both a predictional structure and a non-predictional structure. A quantizer of an LSF coefficient in a wide-band speech coder includes a predictional structure quantizing portion(200), a non-predictional structure quantizing portion(210), and a switching portion(220). The predictional structure quantizing portion calculates a quantized candidate vector by quantizing an LSF coefficient vector, and a predictional quantization final vector of the LSF coefficient vector by trellis coded quantizing the candidate vector with reference to a predicted LSF vector of the LSF coefficient vector. The non-predictional structure quantizing portion calculates a quantized candidate vector by quantizing the LSF coefficient vector, and a non-predictional quantization final vector of the LSF coefficient vector by trellis coded quantizing the candidate vector. The switching portion selects smaller one of differences between the LSF coefficient vector and the predictional and non-predictional quantization final vectors as the final quantization vector of the LSF coefficient vector.
Abstract:
A lossless encoding/decoding apparatus and a method thereof are provided to execute compression on audio signals through bitstreams having small numbers by enhancing the capability of lossless encoding of frequency coefficients. A lossless encoding apparatus includes a bit converter(422), a run length converter(424), and a run length encoder(430). The bit converter generates first bitstreams on respective levels from quantization indexes on frequency coefficients of a current frame. The run length converter generates symbols which are formed by a run length of second bitstreams where the first bit streams are disposed in one row. The run length encoder encodes the symbols to third bitstreams.
Abstract:
A system and a method of exhibition guide service using panoramic images are provided to offer detailed information corresponding to the present location of a client by checking up the location of the client through location tracing sensors. A location tracing sensor(2) receives information of a client terminal device(1), and transmits the client terminal device information and own information together. A location tracer(3) traces the location of the client terminal device, and provides the present location information to the client terminal device. The client terminal device provides the client terminal device information to the location sensor, and receives the present location information from the location tracer so as to guide exhibition information corresponding to the present location to the client.
Abstract:
A system and a method for transmitting and receiving the multi-view point panoramic contents in order to enhance the transmission efficiency are provided to define the multi-view point contents obtained from a camera as each object and to offer only the contents requested by a user at real time. An encoding member(311) encodes the multi-view point panoramic contents. A scene producing member(312) defines the encoded contents as each object descriptor, produces a scene binary format stream and an IOD(Initial Object Descriptor) and produces a scene by means of the corresponding image stream adjacent to the request view point from a user. A packetizer packets the image stream, object descriptor stream, and scene binary format stream from the scene producing member. A multiplexing member(314) multiplexes the packetized image stream, object descriptor stream and scene binary format stream with the IOD produced by the scene producing member and transmits the multiplexed data.
Abstract:
An apparatus and a method for processing multi-channel audio signals are provided to enable active control for a user by controlling a multi channel audio signal according to channels using threshold through which human recognizes displacement of a sound source. A transmitting apparatus(100) comprises a multi channel audio encoder(102), a scene expression language encoder(104) and a multiplexer(106). The multi channel audio encoder generates multi channel audio stream by encoding the multi channel audio signal. The scene expression language encoder generates the scene expression language stream by encoding the multi channel audio control information. The multiplexer multiplexes the multi channel audio stream and the scene expression language stream. The multi channel audio control information includes data of controlling each channel signal individually. The data includes at least one of the number of the channels, the horizontal position of each channel signal, the vertical position of each channel signal, the horizontal transition speed of each channel signal, and the vertical transition speed of each channel signal. The horizontal and vertical transition speeds express the threshold by basic units to allow human to recognize the position of the sound source.
Abstract:
An apparatus for encoding and decoding by using a converter alternatively according to the correlation of residual coefficients and a method thereof are provided to select a converter with a highest compression rate by perform an RD(Rate-Distortion) cost optimization using DCT(Discrete Cosine Transform) and DST(Discrete Sine Transform) in generating a quantized conversion coefficient through converters and quantizers after prediction between images or within the images in a predetermined size of block, thereby improving the compression rate of an image block. A first converter(31-34) performs the DCT(Discrete Cosine Transform) in a block unit, first quantization, first inverse quantization and IDCT(Interger-approximated Discrete Cosine Inverse Transform) of residual coefficients which are generated after performing prediction between images or within the images. A second converter(35-38) performs the DST(Discrete Sine Transform) in the block unit, second quantization, second inverse quantization, and IDST(Interger-approximated Discrete Sine Inverse Transform) of the residual coefficients. A selection unit(39) performs RD Cost(Rate-Distortion Cost) to select a converter with a high compression rate by block, and a display unit(40) displays converter information selected by the selection unit on a corresponding flag bit in a macro block unit.
Abstract:
본 발명은 크기 변화에 대해서 강인한 양자화 기반의 오디오 워터마킹 장치 및 방법에 관한 것이다. 본 발명의 부호기는 오디오 입력신호를 소정의 서브밴드로 분리하는 분리필터뱅크; 상기 오디오 입력신호를 입력받아 심리음향모델을 적용하여 마스크비를 제공하는 심리음향모듈; 상기 분리된 서브밴드중 상기 심리음향모듈의 마스크비에 따라 부호화 계수를 산출하고, 중간주파수대역에 워터마크를 삽입하고 부가정보를 제공하는 워터마크 엔코더; 및 상기 서브밴드신호들을 다시 합해 워터마크가 포함된 오디오신호를 출력하는 합성필터뱅크;를 포함하고, 본 발명의 복호기는 수신된 신호를 소정수의 서브밴드로 분리하는 분리필터뱅크; 수신된 부호화 계수와 워터마크가 삽입된 서브밴드로부터 EM 알고리즘에 따라 크기변화 비율을 추정하여 크기변화에 따른 복호기 양자화 크기(Δ d )를 제공하는 EM 추정기; 상기 EM추정기의 복호기 양자화 크기를 고려하여 중간주파대역의 서브밴드로부터 워터마크를 추출하는 워터마크 디코더; 및 상기 워터마크 디코더의 출력을 합해 워터마크를 결정하는 통합 결정기;를 포함한다. 워터마킹, 양자화, 크기변화, EM 알고리즘, 서브밴드, 추정