-
公开(公告)号:US12267496B2
公开(公告)日:2025-04-01
申请号:US18358111
申请日:2023-07-25
Applicant: Apple Inc.
Inventor: Hilmi Enes Egilmez , Yunfei Zheng , Alican Nalci , Yeqing Wu , Yixin Du , Guoxin Jin , Alexandros Tourapis , Jun Xin , Hsi-Jung Wu
IPC: H04N7/12 , H04N19/12 , H04N19/122 , H04N19/136 , H04N19/159 , H04N19/176 , H04N19/18 , H04N19/186 , H04N19/61 , H04N19/70
Abstract: Video coders and decoders perform transform coding and decoding on blocks of video content according to an adaptively selected transform type. The transform types are organized into a hierarchy of transform sets where each transform set includes a respective number of transforms and each higher-level transform set includes the transforms of each lower-level transform set within the hierarchy. The video coders and video decoders may exchange signaling that establishes a transform set context from which a transform set that was selected for coding given block(s) may be identified. The video coders and video decoders may exchange signaling that establishes a transform decoding context from which a transform that was selected from the identified transform set to be used for decoding the transform unit. The block(s) may be coded and decoded by the selected transform.
-
公开(公告)号:US12192440B2
公开(公告)日:2025-01-07
申请号:US17568266
申请日:2022-01-04
Applicant: Apple Inc.
Inventor: Jiefu Zhai , Xingyu Zhang , Xiaosong Zhou , Jun Xin , Hsi-Jung Wu , Yeping Su
IPC: H04N19/105 , G06N3/08 , H04N19/147 , H04N19/159 , H04N19/176 , H04N19/61
Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.
-
公开(公告)号:US12149699B2
公开(公告)日:2024-11-19
申请号:US17492789
申请日:2021-10-04
Applicant: Apple Inc.
Inventor: Yeping Su , Xingyu Zhang , Chris Chung , Jun Xin , Hsi-Jung Wu
IPC: H04N19/136 , G06N20/00 , H04N19/103 , H04N19/179 , H04N21/2343 , H04N21/845
Abstract: Techniques are disclosed by which a coding parameter is determined to encode video data resulting in encoded video data possessing a highest possible video quality. Features may be extracted from an input video sequence. The extracted features may be compared to features described in a model of coding parameters generated by a machine learning algorithm from reviews of previously-coded videos, extracted features of the previously-coded videos, and coding parameters of the previously-coded videos. When a match is detected between the extracted features of the input video sequence and extracted features represented in the model, a determination may be made as to whether coding parameters that correspond to the matching extracted feature correspond to a tier of service to which the input video sequence is to be coded. When the coding parameters that correspond to the matching extracted feature correspond to the tier of service to which the input video sequence is to be coded, the input video sequence may be coded according to the coding parameters.
-
公开(公告)号:US20240195978A1
公开(公告)日:2024-06-13
申请号:US18065220
申请日:2022-12-13
Applicant: Apple Inc.
Inventor: Yeqing WU , Yunfei Zheng , Alexandros Tourapis , Yixin Du , Alican Nalci , Hilmi E. Egilmez , Guoxin Jin , Jun Xin , Hsi-Jung Wu
IPC: H04N19/137 , H04N19/105 , H04N19/172 , H04N19/176
CPC classification number: H04N19/137 , H04N19/105 , H04N19/172 , H04N19/176
Abstract: The present disclosure describes techniques for efficient coding of motion vectors developed for multi-hypothesis coding applications. According to these techniques, when coding hypotheses are developed, each having a motion vector identifying a source of prediction for a current pixel block, a motion vector for a first one of the coding hypotheses may be predicted from the motion vector of a second coding hypothesis. The first motion vector may be represented by coding a motion vector residual, which represents a difference between the developed motion vector for the first coding hypothesis and the predicted motion vector for the first coding hypothesis, and outputting the coded residual to a channel. In another embodiment, a motion vector residual may be generated for a motion vector of a first coding hypothesis, and the first motion vector and the motion vector residual may be used to predict a second motion vector and a predicted motion vector residual. The second hypothesis's motion vector may be coded as a difference between the motion vector, the predicted second motion vector, and the predicted motion vector residual. In a further embodiment, a single motion vector residual may be output for the motion vectors of two coding hypotheses representing a difference between the motion vector of one of the hypotheses and a predicted motion vector for that hypothesis.
-
公开(公告)号:US11570417B2
公开(公告)日:2023-01-31
申请号:US17326204
申请日:2021-05-20
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Jun Xin , Hsi-Jung Wu , Xiaosong Zhou
IPC: H04N13/00 , H04N21/24 , H04N21/218 , H04N13/117 , H04N13/178
Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.
-
公开(公告)号:US10200687B2
公开(公告)日:2019-02-05
申请号:US15612880
申请日:2017-06-02
Applicant: Apple Inc.
Inventor: Mei Guo , Jun Xin , Yeping Su , Chris Y. Chung , Xiaosong Zhou , Hsi-Jung Wu
IPC: H04N19/117 , H04N19/136 , H04N19/186 , H04N19/46 , H04N19/157 , H04N19/80
Abstract: Methods of sample adaptive offset (SAO) for high dynamic range (HDR) video compression are disclosed. For band offset (BO), the sample range of each component is unequally divided into a number of bands based on the distribution of samples, the electro-optical transfer function (EOTF) of HDR source and one perceptually uniform EOTF. Each component may have a different number of bands, different number of signaled bands, and different range value for signaled offsets in BO mode. For edge offset (EO), Cb and Cr may use separate syntax elements. In the sample classification for EO, the difference between the current sample and its neighbor is compared with a threshold to determine whether it is a local valley, a local peak, or corners. A distortion measure for HDR source images are also disclosed.
-
公开(公告)号:US20240323442A1
公开(公告)日:2024-09-26
申请号:US18603138
申请日:2024-03-12
Applicant: Apple Inc.
Inventor: Alican Nalci , Guoxin Jin , Hilmi Enes Egilmez , Hsi-Jung Wu , Jun Xin , Yixin Du , Yunfei Zheng , Yeqing Wu
IPC: H04N19/625 , H04N19/176 , H04N19/70
CPC classification number: H04N19/625 , H04N19/176 , H04N19/70
Abstract: In an example method, a decoder accesses a bitstream representing video content, and parses one or more flexible coefficient position (FCP) syntax from the bitstream, where the one or more FCP syntax indicate one or more index values. The decoder further determines side information representing one or more characteristics of an encoded portion of the video content. The decoder interprets the one or more FCP syntax based on the side information, including determining a coefficient position with respect to the encoded portion of the video content based on the one or more index values and the side information. The decoder decodes the encoded portion of the video content according to the coefficient position.
-
公开(公告)号:US20230188738A1
公开(公告)日:2023-06-15
申请号:US18076166
申请日:2022-12-06
Applicant: Apple Inc.
Inventor: Alican Nalci , Alexandros Tourapis , Hilmi Enes Egilmez , Hsi-Jung Wu , Jun Xin , Yeqing Wu , Yixin Du , Yunfei Zheng
IPC: H04N19/44 , H04N19/184
CPC classification number: H04N19/44 , H04N19/184
Abstract: In an example method, a decoder obtains a data stream representing video content. The video content is partitioned into one or more logical units, and each of the logical units is partitioned into one or more respective logical sub-units. The decoder determines that the data stream includes first data indicating that a first logical unit has been encoded according to a flexible skip coding scheme. In response, the decoder determines a first set of decoding parameters based on the first data, and decodes each of the logical sub-units of the first logical unit according to the first set of decoding parameters.
-
公开(公告)号:US20220377304A1
公开(公告)日:2022-11-24
申请号:US17326204
申请日:2021-05-20
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Jun Xin , Hsi-Jung Wu , Xiaosong Zhou
IPC: H04N13/117 , H04N21/218 , H04N13/178 , H04N21/24
Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.
-
公开(公告)号:US20220360814A1
公开(公告)日:2022-11-10
申请号:US17736881
申请日:2022-05-04
Applicant: Apple Inc.
Inventor: Yeqing Wu , Alexandros Tourapis , Yunfei Zheng , Hsi-Jung Wu , Jun Xin , Albert E. Keinath , Mei Guo , Alican Nalci
IPC: H04N19/52 , H04N19/176 , H04N19/105 , H04N19/159 , H04N19/124
Abstract: An encoder or decoder can perform enhanced motion vector prediction by receiving an input block of data for encoding or decoding and accessing stored motion information for at least one other block of data. Based on the stored motion information, the encoder or decoder can generate a list of one or more motion vector predictor candidates for the input block in accordance with an adaptive list construction order. The encoder or decoder can predict a motion vector for the input block based on at least one of the one or more motion vector predictor candidates.
-
-
-
-
-
-
-
-
-