Abstract:
There is provided methods, apparatuses and computer program products for image processing in which a pair of images may be downsampled to lower resolution pair of images and further to obtain a disparity image representing estimated disparity between at least a subset of pixels in the pair of images. A confidence of the disparity estimation may be obtained and inserted into a confidence map. The disparity image and the confidence map may be filtered jointly to obtain a filtered disparity image and a filtered confidence map by using a spatial neighborhood of the pixel location. An estimated disparity distribution of the pair of images may be obtained through the filtered disparity image and the confidence map.
Abstract:
There is disclosed a method, an apparatus, a server, a client and a non-transitory computer readable medium comprising a computer program stored therein for video coding and decoding. Depth pictures from a plurality of viewing angles are projected into a single viewing angle, making it possible to have pixel-wise joint filtering to be applied to all projected depth values. This approach enables to suppress the noise in the depth map data and provides improved performance for a view synthesis.
Abstract:
A method comprises receiving a first sequence of access units and a second sequence of access units; decoding at least one access unit of the first sequence of access units; decoding a first decodable access unit of the second sequence of access units; determining whether a next decodable access unit in the second sequence of access units can be decoded before an output time of the next decodable access unit in the second sequence of access units; and skipping decoding of the next decodable access unit based on determining that the next decodable access unit cannot be decoded before the at least one of the decoding time and the output time of the next decodable access unit.
Abstract:
There is disclosed a method, apparatus and computer program product for adaptive streaming. At least one file comprising media data is generated, wherein a first segment and a second segment are received, and a first instruction and a second instruction are received. The first segment and the second segment are modified on the basis of the first instruction and the second instruction. The at least one file is created on the basis of the modified first segment and the modified second segment.
Abstract:
A method, apparatus, system and computer program product are provided to provide switching point information to facilitate switching between different representations of the media content. In an instance in which a content consumption device determines that a switch from a first representation to a second representation is merited, the content consumption device may identify the appropriate switching point from the switching point information provided by the server. The content consumption device may then request the second representation of the media content beginning at the switching point.
Abstract:
An apparatus comprising a controller configured to divide a section of an encoded multimedia signal into at least two segments depending on a time based decoding criteria; a generator configured to determine an error correction code for each of the at least two time segments; and a distributor configured to associate the error correction code for each of the at least two time segments with the section of the encoded multimedia signal and with a section of at least one further encoded multimedia signal.
Abstract:
In accordance with an example embodiment of the present invention, an apparatus comprising a processing unit configured to receive information related to available camera views of a three dimensional scene, request a synthetic view which is different from any available camera view and determined by the processing unit and receive media data comprising video data associated with the synthetic view.
Abstract:
In one aspect, a method includes receiving a multimedia data block. The multimedia data block includes a plurality of data parts. The method further includes interleaving the plurality of data parts to form an interleaved data block based on a priority order. The priority order based on a multimedia data type of the data parts. The method further includes transmitting the interleaved data block. In another aspect, a method includes receiving one or more data parts of an interleaved multimedia data block and determining a first data part of the one or more data parts based on information. The method further includes determining a schedule entry in a schedule for the first data part based on other information.
Abstract:
Joint coding of depth map video and texture video is provided, where a motion vector for a texture video is predicted from a respective motion vector of a depth map video or vice versa. For scalable video coding, depth map video is coded as a base layer and texture video is coded as an enhancement layer(s). Inter-layer motion prediction predicts motion in texture video from motion in depth map video. With more than one view in a bit stream (for multi view coding), depth map videos are considered monochromatic camera views and are predicted from each other. If joint multi-view video model coding tools are allowed, inter- view motion skip issued to predict motion vectors of texture images from depth map images. Furthermore, scalable multi-view coding is utilized, where interview prediction is applied between views in the same dependency layer, and inter-layer (motion) prediction is applied between layers in the same view.
Abstract:
Systems and methods for storing notification messages in an ISO base media file are provided, where different transport cases when notification messages are to be stored are addressed. The systems and methods enable the linking of notification message parts delivered over RTP with other parts of a notification message carried over file delivery over unidirectional transport (FLUTE) or some other protocol. Various implementations of the systems and methods can be generic and allow objects delivered out-of-band to be referenced from media and hint tracks. Additionally the lifecycle of notification objects can be reproduced in the file without timers required in the parsing of the file.