Abstract:
Methods for processing a set of successive video frames in two passes to determine the number of bidirectional (B) and unidirectional (P) motion to be encoded in a video coding system. During the first pass, motion vectors and motion costs are computed for each frame and a derived cost value is computed based on the motion cost of at least one frame. The derived cost value is used determine the number (N g ) of B-frames to be encoded in the set of successive frames. In the second pass, the set of successive frames are encoded where N B) frames are encoded as B-frames and some or all motion vectors computed in the first pass are reused in the second pass. A scene cut detection method is also provided where an impulse-like increase in a ration of motion costs is monitored
Abstract:
A rate control system is disclosed for video coding applications. The rate controller assigns a quantization parameter for video data in a picture in response to complexity indicators indicative of spatial complexity, motion complexity and/or bits per pel of the picture. A virtual buffer based quantizer parameter is proposed based on a virtual buffer fullness analysis and a target rate estimate, which is derived from the complexity indicators. A second quantizer parameter is proposed from a linear regression analysis of quantizer parameters used to code previously coded pictures of similar type (e.g., I pictures, P pictures or B pictures). A coding policy decision unit defines a final quantizer parameter from a comparison of the two proposed quantizer parameters.
Abstract:
A method and apparatus for variable accuracy inter-picture timing specification for digital video encoding is disclosed. Specificallly the present invention discloses a system that allows the relative timing of nearby video pictures to be encoded in a very efficient manner. In one embodiment, the display time difference between a current video picture (105) and a nearby video picture is determined. The display time difference is then encoded (180) into a digital representation of the video picture. In a preferred embodiment, the nearby video picture is the most recently transmitted stored picture. For coding efficiency, the display time difference may be encoded using a variable length coding system or arithmetic coding. In an alternate embodiment, the display time difference is encoded as a power of two to reduce the number of bits transmitted.
Abstract:
In this disclosure, a novel method for direct mode enhancement in B-pictures and skip mode enhancement in P-pictures in the framework of H.264 (MPEG-4/Part 10) is disclosed. Direct mode and skip mode enhancements are achieved by clustering the values of the Lagrangian, removing outliers and specifying smaller values of the Lagrangian multiplier in the rate-distortion optimization for encoding mode selection. Experimental results using high quality video sequences show that bit rate reduction is obtained using the method of the present invention, at the expense of a slight loss in peak signal-to-noise ratio (PSNR). By conducting two different experiments, it has been verified that no subjective visual loss is visible despite the peak signal-to-noise ratio change. In relationship to the existing ratedistortion optimization methods currently employed in the (non-normative) MPEG4/Part 10 encoder, the method of the present invention represents a simple and useful add-on. More importantly, when other solutions such as further increasing the values of the quantization parameter are not applicable, as inadmissible artifacts would be introduced in the decoded pictures, the method of the present invention achieves bit rate reduction without introducing visible distortion in the decoded sequences. Even more, despite the fact that the present document makes use of the H.264 framework, the proposed method is applicable in any video encoding system that employs ratedistortion optimization for encoding mode selection.
Abstract translation:在本公开中,公开了一种用于H.264(MPEG-4 / Part 10)的框架中的P图像中的B图像和跳过模式增强中的直接模式增强的新颖方法。 直接模式和跳过模式增强是通过对拉格朗日数值进行聚类来实现的,该值是在编码模式选择的速率失真优化中去除异常值并指定较小的拉格朗日乘数值。 使用高质量视频序列的实验结果表明,使用本发明的方法可以以峰值信噪比(PSNR)的轻微损失为代价来获得比特率降低。 通过进行两个不同的实验,已经证实,尽管峰值信噪比变化,没有主观视觉损失是可见的。 与目前在(非规范)MPEG4 / Part 10编码器中使用的现有额定优化方法有关,本发明的方法代表了一种简单有用的附加组件。 更为重要的是,当进一步增加量化参数值的其他解决方案不适用时,由于不可接受的伪像将被引入到解码图像中,所以本发明的方法实现比特率降低而不引入解码序列中的可见失真。 更重要的是,尽管本文件使用了H.264框架,但是所提出的方法可应用于采用额定优化来进行编码模式选择的任何视频编码系统。
Abstract:
In this disclosure, a novel method for direct mode enhancement in B-pictures and skip mode enhancement in P-pictures in the framework of H.264 (MPEG-4/Part 10) is disclosed. Direct mode and skip mode enhancements are achieved by clustering the values of the Lagrangian, removing outliers and specifying smaller values of the Lagrangian multiplier in the rate-distortion optimization for encoding mode selection. Experimental results using high quality video sequences show that bit rate reduction is obtained using the method of the present invention, at the expense of a slight loss in peak signal-to-noise ratio (PSNR). By conducting two different experiments, it has been verified that no subjective visual loss is visible despite the peak signal-to-noise ratio change. In relationship to the existing ratedistortion optimization methods currently employed in the (non-normative) MPEG4/Part 10 encoder, the method of the present invention represents a simple and useful add-on. More importantly, when other solutions such as further increasing the values of the quantization parameter are not applicable, as inadmissible artifacts would be introduced in the decoded pictures, the method of the present invention achieves bit rate reduction without introducing visible distortion in the decoded sequences. Even more, despite the fact that the present document makes use of the H.264 framework, the proposed method is applicable in any video encoding system that employs ratedistortion optimization for encoding mode selection.
Abstract:
A Method And Apparatus For Control of Rate-Distortion Tradeoff by Mode Selection in Video Encoders is Disclosed. The system of the present invention first selects a distortion value D near a desired distortion value. Next, the system determines a quantizer value Q using the selected distortion value D. The system then calculates a Lagrange multiplier lambda using the quantizer value Q. Using the selected Lagrange multiplier lambda and quantizer value Q, the system begins encoding pixelblocks. If the system detects a potential buffer overflow, then the system will increase the Lagrange multiplier lambda. If the Lagrange multiplier lambda exceeds a maximum lambda threshold then the system will increase the quantizer value Q. If the system detects a potential buffer underflow, then the system will decrease the Lagrange multiplier lambda. If the Lagrange multiplier lambda falls below a minimum lambda threshold then the system will decrease the quantizer value Q.
Abstract:
Methods for processing a set of successive video frames in two passes to determine the number of bidirectional (B) and unidirectional (P) motion to be encoded in a video coding system. During the first pass, motion vectors and motion costs are computed for each frame and a derived cost value is computed based on the motion cost of at least one frame. The derived cost value is used determine the number (Ng) of B-frames to be encoded in the set of successive frames. In the second pass, the set of successive frames are encoded where NB) frames are encoded as B-frames and some or all motion vectors computed in the first pass are reused in the second pass. A scene cut detection method is also provided where an impulse-like increase in a ration of motion costs is monitored
Abstract:
A method and apparatus for variable accuracy inter-picture timing specification for digital video encoding is disclosed. Specificallly the present invention discloses a system that allows the relative timing of nearby video pictures to be encoded in a very efficient manner. In one embodiment, the display time difference between a current video picture (105) and a nearby video picture is determined. The display time difference is then encoded (180) into a digital representation of the video picture. In a preferred embodiment, the nearby video picture is the most recently transmitted stored picture. For coding efficiency, the display time difference may be encoded using a variable length coding system or arithmetic coding. In an alternate embodiment, the display time difference is encoded as a power of two to reduce the number of bits transmitted.
Abstract:
A rate control system is disclosed for video coding applications. The rate controller assigns a quantization parameter for video data in a picture in response to complexity indicators indicative of spatial complexity, motion complexity and/or bits per pel of the picture. A virtual buffer based quantizer parameter is proposed based on a virtual buffer fullness analysis and a target rate estimate, which is derived from the complexity indicators. A second quantizer parameter is proposed from a linear regression analysis of quantizer parameters used to code previously coded pictures of similar type (e.g., I pictures, P pictures or B pictures). A coding policy decision unit defines a final quantizer parameter from a comparison of the two proposed quantizer parameters.