Abstract:
In one example of the disclosure, a method of coding video data comprises coding video data using texture-first coding, and performing an NBDV derivation process for a block of the video data using a plurality of neighboring blocks. The NBDV derivation process comprises designating a motion vector associated with a neighboring block of the plurality of neighboring blocks coded with a block-based view synthesis prediction (BVSP) mode as an available disparity motion.
Abstract:
As part of a video encoding process or a video decoding process, a video coder may determine a first available disparity motion vector among spatial neighboring blocks of a current block of the video data. Furthermore, the video coder may shift a horizontal component of the first available disparity motion vector to derive a shifted disparity motion vector candidate (DSMV). The video coder may add the DSMV into a merge candidate list.
Abstract:
In one example, a video coder, such as a video encoder or decoder, is configured to code a value for a layer identifier in a slice header for a current slice in a current layer of multi-layer video data, and, when the value for the layer identifier is not equal to zero, code a first set of syntax elements in accordance with a base video coding standard, and code a second set of one or more syntax elements in accordance with an extension to the base video coding standard. The second set of syntax elements may include a syntax element representative of a position for an identifier of an inter-layer reference picture of a reference layer in a reference picture list, and the video coder may construct the reference picture list such that the identifier of the inter-layer reference picture is located in the determined position.
Abstract:
Techniques are described related to constructing reference picture lists. The reference picture lists may be constructed from reference picture subsets of a reference picture set. In some examples, the reference picture subsets may be ordered in a particular manner to form the reference picture lists.
Abstract:
Techniques are described related to performing random access starting from a random access point picture that is not an instantaneous decoder refresh picture. Some techniques are also related to reducing the amount of information that is signaled for long-term reference pictures of a reference picture set. Additional techniques are also related to decoded picture buffer management, such as removing decoded pictures based on a temporal identification value.
Abstract:
A block request streaming system provides for improvements in the user experience and bandwidth efficiency of such systems, typically using an ingestion system that generates data in a form to be served by a conventional file server (HTTP, FTP, or the like), wherein the ingestion system intakes content and prepares it as files or data elements to be served by the file server. The system might include controlling the sequence, timing and construction of block requests, time based indexing, variable block sizing, optimal block partitioning, control of random access point placement, including across multiple presentation versions, dynamically updating presentation data, and/or efficiently presenting live content and time shifting.
Abstract:
A video coding apparatus may be configured to utilize media extractors in a media extractor track that reference two or more non-consecutive network access layer (NAL) units of a separate track. An example apparatus includes a multiplexer to construct a first track including a video sample comprising NAL units, based on encoded video data, wherein the video sample is included in an access unit, construct a second track including an extractor that identifies at least first one of the NAL units in the video sample of the first track, and wherein the extractor identifies a second NAL unit of the access unit, wherein the first identified NAL unit and the second identified NAL unit are non-consecutive, and include the first track and the second track in a video file conforming at least in part to ISO base media file format. The identified NAL units may be in separate tracks.
Abstract:
Techniques for encapsulating video streams containing multiple coded views in a media file are described. In one example, a method includes parsing a track of multiview video data, wherein the track includes one or more views, including only one of a texture view and a depth view of a particular view. The method further includes parsing a track reference to determine a dependency of the track to a reference track. The track reference types include 'deps' that indicates that the track includes the depth view of the particular view and the reference track includes the texture view of the particular view, 'tref' that indicates that the track depends on the texture view of the particular view which is stored in the referenced track, and 'dref' that indicates that the track depends on the depth view of the particular view which is stored in the referenced track.
Abstract:
Techniques for encapsulating video streams containing multiple coded views in a media file are described herein. In one example, a method includes parsing a track of video data, wherein the track includes one or more views. The method further includes parsing information to determine whether the track includes only texture views, only depth views, or both texture and depth views. Another example method includes composing a track of video data, wherein the track includes one or more views and composing information that indicates whether the track includes only texture views, only depth views, or both texture and depth views.
Abstract:
In one example of the disclosure, a method of coding video data comprises coding video data using texture-first coding, and performing an NBDV derivation process for a block of the video data using a plurality of neighboring blocks. The NBDV derivation process comprises designating a motion vector associated with a neighboring block of the plurality of neighboring blocks coded with a block-based view synthesis prediction (BVSP) mode as an available disparity motion.