Abstract:
A process for encoding digital video signals (IS) organized in frames (12) comprises the operations of dividing said frames into blocks starting from macroblocks subjected to motion-compensation (13, 14) and applying to said blocks a discrete cosine transform (15) in such a way as to generate respective sets of coefficients. The said sets of coefficients are then assembled (20) by being organized into sets of vectors (X) by means of masking (M 1 , M 2 , M 3 , M 4 ). Once the variance of the vectors has been detected (214), the vectors themselves are quantized (217 to 220) on a number of available bits by means of a pyramid vector quantizer (22), associating to the vectors respective quantization pyramids having given sizes according to the variance detected and to the number of available bits. Finally, the vectors are encoded with respective codewords.
Abstract:
The compression and coding of digital data pertaining to video sequences of pictures including motion estimation for removing temporal redundance greatly benefit from an ability to recognize the occurrence of a change of scene in order to command the prediction computation of the pictures using only a forward motion estimation for pictures preceding the change of scene and only a backward motion estimation for picture subsequent the change of scene of a given sequence. According to the method of the invention, a change of scene is reliably detected by checking two distinct indexes. Spurious detections in the event of noisy pictures, zooming and other situations that may affect either one or the other index are prevented.
Abstract:
Digital video signals, such as RGB signals, are subjected to processing by ordering the pixels into groups (M) of adjacent pixels (A, B, C, D) and carrying out the operations of:
- i) truncating the respective string of bits, eliminating a given number (n) of least-significant bits, thus generating a respective truncated string; - ii) adding to the truncated string an integration factor constituted by a power of two of a given order, thus generating a respective integrated string; and - iii) calculating first and second values of pixel error corresponding to the difference between the respective string and the truncated string and the respective string and the integrated string, respectively.
There is calculated a set of values of global error comprising all the possible combinations of addition of the first and second values of pixel error on the group (M), then locating the value of minimum error in the set of values of global error. Once, in the combination that produces the value of minimum global error, a first and a second number of pixels have been identified whereby the contribution to the value of minimum global error is given by the first value of pixel error and by the second value of pixel error, as a compressed version of the video signals represented by the pixels in the group, there is used a first number of truncated strings and a second number of integrated strings, equal, respectively, to the first number of pixels and to the second number of pixels, with the integrated strings subjected to truncation.
Abstract:
A process and a system for generating, starting from an MPEG input bitstream (IS), an MPEG output bitstream (OS), the output bitstream (OS) having a resolution (Hor/N x Vert/M) modified with respect to the resolution (Hor x Vert) of the input bitstream (IS). The process comprises the operations of:
distinguishing (100), in the input bitstream (IS), first portions and second portions which respectively substantially do not affect and do affect the variation of resolution; and subjecting (114 to 122) said second portions of the input bitstream (IS) to a function of modification of the resolution obtained by filtration in the domain of the discrete cosine transform (DCT), and then transferring (134), to said output bitstream (OS), said second portions subjected to filtering in the domain of the discrete cosine transform.
The invention also relates to the corresponding computer program product.
Abstract:
A method of producing an output bitstrcam of coded digital video data, with a bit-rate different from the bit-rate of an input bitstream, includes the steps of
dividing said input bitstream into a sequence of coded data and a sequence of control bits; modifying said sequence of control bits in function of the different bit-rate of the output bitstream that is desired, producing an output sequence of control bits; decoding said sequence of coded data producing an intermediate sequence of data; quantizing with a pre-established step and coding said intermediate sequence of data producing an output sequence of coded data; merging said output sequences producing said output bitstream with the desired bit-rate.
Abstract:
The efficiency of a method of coding video data according to a standard wherein several types of fields (P, B) beside reference or Intra (I) fields are identified and various coding options exist for said non-Intra fields (P, B) including the removal of temporal redundancy from non-Intra fields (P, B) by way of an algorithm of motion estimation and identification of predictor macroblocks providing the best approximation according to a prediction mode selectable among respective predicting modes of the different types of non-Intra fields (P, B), calculating the discrete cosine transform (DCT) for blocks of data of said macroblocks of data according to a frame mode of decomposition, wherein the blocks are composed of lines of data belonging to a even lines semifield and to a odd lines semifield or in a field mode of decomposition, wherein the blocks are composed of lines of data belonging to the same semifield, quantizing and coding the data resulting from said discrete cosine transform (DCT) carried out by blocks of data to be stored, is increased by operating a choice between field or frame modes and motion comparation or no motion comparation with an algorithm of assessment and comparison of significative complexity indexes of macroblocks.
Abstract:
By relaying on a temporal correlation among successive pictures beside on a spatial correlation of motion vectors of macroblocks of the currently processed picture and by the use of a hierarchical recursive motion estimation algorithm, the hardware complexity of video coders complying with the MPEG-2 standard can be greatly reduced without an appreciable loss of quality of the video images been transferred. The method of motion estimation and a hardware embodiment of a coder are described and performance is compared with a prior motion estimation system.