Abstract:
A scalable frame compatible three-dimensional video encoding and decoding system for use in a multiview video coding system is described. A base layer includes low resolution information from a plurality of views while one or more enhancement layers may include high resolution information for at least one of the plurality of views. Interpolation filters are derived based on a combination of low resolution information and high resolution information are discussed. For a given view, sending high resolution information at some times and low resolution information at other times are also described.
Abstract:
Deriving illumination compensation parameters and detection of illumination dominant transitions types for video coding and processing applications is described. Illumination changes such as fade-ins, fade-outs, cross-fades, and flashes are detected. Detection of these illumination changes is then used for weighted prediction to provide for improved illumination compensation.
Abstract:
Methods and systems for multi-layer video encoding are described. In the system, an inter-layer analysis module gathers and analyzes inter-layer coding statistics, so that a rate control module can perform rate control on the various layers based on the statistics.
Abstract:
Methods and systems for coding applications are described, where multiple coding passes are performed for purposes of rate control. Prediction structures are generated and used by the multiple coding passes to update coding statistics generated prior to the coding passes. The updated coding statistics are used for a subsequent coding operation.
Abstract:
Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the decoded sub-images of each frame reformats the images for a display device. A 3D video may be encoded using a most advantageous interleaving format such that a preferred quality and compression ratio is reached. In one embodiment, the invention includes a display device that accepts data in multiple formats.
Abstract:
Coding method complexity types are assigned to video for combinations of coding decision levels associated with a sequence of pictures, a group of pictures, pictures, a scene, a region of the scene or picture, a block or macroblock. Multiple coding units are encoded as a function of complexity and determined from the coding decision levels. A first coding decision level is associated with determining a first complexity and a first coding method type for the sequence of pictures. A second coding decision level is associated with determining a second complexity and a second coding method type for the pictures or scene. The picture or sequence includes the coding units. A third coding decision level is associated with determining a third complexity and a third coding method type for the coding units.
Abstract:
Controlling a feature of an encoding process for regions of an image pattern representing more than one image when the regions include an amount of disparity in the represented images that would result in cross-contamination between the represented images if encoded with the feature. The control may be, for example, any of, turning the encoding feature off, using the encoding feature less often than when encoding an image pattern representing a single image, negatively biasing the encoding feature, and enabling the encoding feature for regions determined to have zero or near zero disparity and disabling the feature for all other regions. The represented images comprise, for example, any of a stereoscopic view, multiple stereoscopic views, multiple views of a same scene, and multiple unrelated views.
Abstract:
Techniques and systems are disclosed that relate to overlapped block disparity estimation and compensation. Some methods for compensation of images with overlapped block disparity compensation (OBDC) involve determining if OBDC is enabled in a video bit stream, and determining if OBDC is enabled for one or more macroblocks that neighbor a first macroblock within the video bit stream. The one or more neighboring macroblocks can be transform coded. If OBDC is enabled in the video bit stream and for the one or more neighboring macroblocks, the methods involve performing prediction for a region of the first macroblock that has an edge adjacent with the one or more neighboring macroblocks. OBDC can be causally applied. The methods can involve sharing or copying one or more disparity compensation parameters or modes amongst one or more views or layers. Various types of prediction can be used with causally-applied OBDC features and techniques.
Abstract:
Adaptive motion information cost estimation is achieved in processing video information. A transmission cost is estimated that is associated with encoding a motion vector difference (mvd) in motion vectors that describe a motion characteristic of the video information. The mvd is encoded based on minimizing a rate estimation mismatch associated with the motion vectors. The encoding step includes computing a bit count associated with the mvd using CABAC. A value is indexed that corresponds to the cost from one or more entries in the lookup table. The cost relates to context or content characteristics associated with the video information. The lookup table is adaptively updated based on a change in the contextual information or content characteristics. The value is dynamically adjustable based on the change.
Abstract:
A method and apparatus are provided for motion estimation using combined reference bi-prediction. The apparatus includes an encoder (200) for encoding a multi-prediction picture from a combination of two or more reference pictures by respectively predicting a motion vector for a particular one of the two or more reference pictures in a motion estimation process while initializing motion vectors for remaining ones of the two or more reference pictures to a predefined value for use by the motion estimation process.