Abstract:
Provided are apparatuses and methods for reducing channel switching time in a network environment. The user of a mobile terminal may receive a data stream containing numerous television programs or services. Each of the programs or services may have a Packet Identification (PID) and/or IP address. The invention reduces channel switching time by predicting channel changing behavior. Various procedures are presented in which data is received and cached. The cached data may then be used to reduce the channel switching time during synchronizing and decoding of the new channel.
Abstract:
A method for signaling ROI scalability information in a file format. The present invention provides an efficient signaling of ROI scalability information in the file format, wherein the signaling comprises providing the geometrical information of a ROI and an indication to identify the ROI each coded data unit is associated with within a tier or layer.
Abstract:
The time until a media decoder is refreshed to produce correct output samples is minimized if an IDR picture is the first picture in a decoding order in each MPE-FEC frame. A system enabling minimization of tune-in delay is described. Aspects of the system include a content encoder for receiving a source signal including first and second bit streams and for encoding the first and second bit streams, the encoded first bit stream including a decoder refresh bit stream and the encoded second bit stream including a media bit stream. A server receives the encoded first and second bit streams and encapsulates the encoded first and second bit streams into packets. An encapsulator then receives the packets and composes a time-sliced multi-protocol encapsultation (MPE) stream by replacing a spliceable picture from the encoded second bit stream.
Abstract:
The use of a picture delimiter that is contained in a NAL unit type value that is reserved in the current AVC or SVC specification. The present invention provides the scalability information for the H.264/AVC base layer in such a manner that bitstreams remain decodable with H.264/AVC decoders. In addition, the picture delimiter of the present invention may contain many other syntax elements that can help in easier processing of bitstreams compared to the plain H.264/AVC bitstream syntax.
Abstract:
A method and system for compressed domain video mixing for spatially combining incoming video streams into an outgoing video stream. Using H.264 as an example, each incoming stream is divided into a plurality of slices, each having a plurality of header fields including a first mb in slice header field. Based on the picture format in the outgoing stream, first mb in slice for each incoming stream is modified such that the modified first mb in slice header field is indicative of location in the spatial representation of the outgoing stream at which the slice of the incoming stream is placed. H.264's slice group mechanism is used to map the spatial positions of the second and following macroblocks of the slices to the appropriate locations. If the incoming streams are previously mixed by upstream mixers, a decomposer can be used to separate these mixed streams into component streams before combining them with other incoming streams.
Abstract:
A system and method for conveying information that is helpful for a network middlebox or a media player to decided which coded data units to forward or process within an RTP payload or a file format data unit in an easy-to-access manner. This mechanism can be used to provide indications of items such as redundant coded pictures, temporal level switching points, gradual decoding refresh access points, view identifiers, and view random access points. A middlebox and/or receiver can then use this information to determine whether certain coded data units need to be processed and/or transmitted.
Abstract:
A system and method by which instantaneous decoding refresh (IDR)/intra pictures that enable one to tune in or randomly access a media stream are included within a "normal" bitstream as redundant coded pictures. In various embodiments, each intra picture for tune-in is provided as a redundant coded picture, in addition to the corresponding primary inter-coded picture.
Abstract:
A system and method for separating the data needed for inter-layer prediction and data unneeded for inter-layer prediction in the bitstream. For the coded data of a picture, the decoding of the data needed for inter-layer prediction is performed independent of the data not needed for inter-layer prediction, and it is identified whether the data is needed for inter-layer prediction.
Abstract:
A device, an encoding method, and a decoding method enable a separate marking of base representations and enhanced representations of key access units to save memory or to allow a better scalable video coding optimization. The encoding method of a sequence of original pictures to a sequence of access units includes, after encoding of one of the access units, storing a first decoded picture of the first encoded picture and a second decoded picture of the second encoded picture for inter prediction for encoding of others of the access units; and identifying the first decoded picture and the second decoded picture to be no longer used for inter prediction. The decoding method includes decoding the first access unit, where a first decoded picture is decoded from the first picture and a second decoded picture is decoded from the second picture; marking the first and second decoded pictures as used for inter prediction; decoding the second access unit; and marking one of the first and second decoded pictures as no longer used for inter prediction.
Abstract:
A system and method for signaling low-to-high layer switching points in a file format level to enable efficient scalable stream switching in streaming servers and local file playback. The present invention also provides for a system and method for signaling low-to-high layer switching points in video bit stream, e.g., to enable intelligent forwarding of scalability layers in media-aware network elements or computationally scalable decoding in stream recipients.