-
公开(公告)号:US10574997B2
公开(公告)日:2020-02-25
申请号:US15796313
申请日:2017-10-27
Applicant: Apple Inc.
Inventor: Chris Y. Chung , Mei Guo , Hsi-Jung Wu , Jingteng Xue , Jun Xin
IPC: H04N19/176 , H04N19/117 , H04N19/147 , H04N19/82 , H04N19/105 , H04N19/196
Abstract: Systems and methods are disclosed for coding pixel blocks of an input frame in which coding costs of a plurality of candidate coding modes are estimated. A coding cost of a candidate coding mode may be estimated based on noise estimate associated with the candidate coding mode. A coding mode for the input pixel block may be selected based on a comparison of the estimated coding costs of the plurality of candidate coding modes. The input pixel block may then be coded according to the selected coding mode.
-
32.
公开(公告)号:US20180091812A1
公开(公告)日:2018-03-29
申请号:US15275076
申请日:2016-09-23
Applicant: Apple Inc.
Inventor: Mei Guo , Jae Hoon Kim , Jun Xin , Feng Yi , Yeping Su , Dazhong Zhang , Chris Chung , Xiaosong Zhou , Hsi-Jung Wu
IPC: H04N19/117 , H04N19/86 , H04N19/14 , H04N19/15 , H04N19/61 , H04N19/124 , H04N19/13 , H04N19/159
CPC classification number: H04N19/117 , H04N19/124 , H04N19/13 , H04N19/134 , H04N19/136 , H04N19/14 , H04N19/15 , H04N19/159 , H04N19/174 , H04N19/176 , H04N19/182 , H04N19/184 , H04N19/186 , H04N19/61 , H04N19/82 , H04N19/86
Abstract: Techniques are disclosed for selecting deblocking filter parameters in a video decoding system. According to these techniques, a boundary strength parameter may be determined based, at least in part, on a bit depth of decoded video data. Activity of a pair of decoded pixel blocks may be classified based, at least in part, on the determined boundary strength parameter, and when a level of activity indicates that deblocking filtering is to be applied to the pair of pixel blocks, pixel block content at a boundary between the pair of pixel blocks may be filtered using filtering parameters derived at least in part based on the bit depth of the decoded video data. The filtering parameters may decrease strength with increasing bit depth of the decoded video data, which improves quality of the decoded video data.
-
公开(公告)号:US12267506B2
公开(公告)日:2025-04-01
申请号:US17837846
申请日:2022-06-10
Applicant: Apple Inc.
Inventor: Mei Guo , Jun Xin , Hsi-Jung Wu , Alexandros Tourapis , Krishnakanth Rapaka
IPC: H04N19/159 , H04N19/176 , H04N19/182
Abstract: Techniques are disclosed for deriving prediction pixel blocks for use in intra-coding video and combined inter- and intra-coding video. In a first aspect, the techniques may include deriving value(s) for pixel location(s) of the prediction pixel block by, when a prediction direction vector assigned to the prediction vector points to quadrants I or III of a Cartesian plane, deriving the pixel location's value from pixel values in two regions of previously-decoded pixel data intercepted by extending the prediction direction vector in two opposite directions through the pixel location. When the prediction direction vector points toward quadrants II of the Cartesian plane, deriving the pixel location's value from pixel values in one region intercepted by the prediction direction vector through the pixel location, and from a second region intercepted by a vector that is orthogonal to the prediction direction vector.
-
公开(公告)号:US12137199B2
公开(公告)日:2024-11-05
申请号:US18407198
申请日:2024-01-08
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Jun Xin , Hsi-Jung Wu , Xiaosong Zhou
IPC: H04N13/117 , H04N13/178 , H04N21/218 , H04N21/24
Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.
-
公开(公告)号:US20240048776A1
公开(公告)日:2024-02-08
申请号:US17956444
申请日:2022-09-29
Applicant: Apple Inc.
Inventor: Yixin Du , Alexandros Tourapis , Alican Nalci , Guoxin Jin , Hilmi Enes Egilmez , Hsi-Jung Wu , Jun Xin , Yeqing Wu , Yunfei Zheng
IPC: H04N19/86 , H04N19/117 , H04N19/139 , H04N19/176
CPC classification number: H04N19/86 , H04N19/117 , H04N19/139 , H04N19/176
Abstract: Disclosed is a method that includes receiving an image frame having a plurality of coded blocks, determining a prediction unit (PU) from the plurality of coded blocks, determining one or more motion compensation units arranged in an array within the PU, and applying a filter to one or more boundaries of the one or more motion compensation units. Also disclosed is a method that includes receiving a reference frame that includes a reference block, determining a timing for deblocking a current block, performing motion compensation on the reference frame to obtain a predicted frame that includes a predicted block, performing reconstruction on the predicted frame to obtain a reconstructed frame that includes a reconstructed PU, and applying, at the timing for deblocking the current block, a deblocking filter based on one or more parameters to the reference block, the predicted block, or the reconstructed PU.
-
公开(公告)号:US20240040151A1
公开(公告)日:2024-02-01
申请号:US18312077
申请日:2023-05-04
Applicant: Apple Inc.
Inventor: Alican Nalci , Yunfei Zheng , Hilmi E. Egilmez , Yeqing WU , Yixin Du , Alexis Tourapis , Jun Xin , Hsi-Jung Wu
IPC: H04N19/60 , H04N19/124 , H04N19/18 , H04N19/172
CPC classification number: H04N19/60 , H04N19/124 , H04N19/18 , H04N19/172
Abstract: Techniques are described for express and implied signaling of transform mode selections in video coding. Information derived from coefficient samples in a given transform unit (TU) or prediction unit (PU) may constrain or modify signaling of certain syntax elements at the coding block (CB), TU, or PU levels. For instance, based on the spatial locations of decoded coefficients, the spatial patterns of coefficients, or the correlation with the coefficients in neighboring blocks, various syntax elements such as the transform type and related flags/indices or secondary transform modes/flags indices, a residual coding mode, intra and inter prediction modes, and scanning order may be disabled or constrained. In another case, if the coefficient samples match a desired spatial pattern or have other desired properties then a default transform type, a default secondary transform type, a default intra and inter prediction mode or other block level modes may be inferred at the decoder side. Similar ideas may be extended to infer intra-block copy (IBC) flags/modes/indices from coefficient samples, or to infer flags and modes related to predictive coding techniques such as block-wise differential pulse code modulation (BDPCM), i.e., horizontal and vertical BDPCM modes, palette mode syntax and other block level flags/indices.
-
公开(公告)号:US20230117742A1
公开(公告)日:2023-04-20
申请号:US18083468
申请日:2022-12-16
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Jun Xin , Hsi-Jung Wu , Xiaosong Zhou
IPC: H04N13/117 , H04N21/218 , H04N21/24 , H04N13/178
Abstract: A system obtains a data set representing immersive video content for display at a display time, including first data representing the content according to a first level of detail, and second data representing the content according to a second higher level of detail. During one or more first times prior to the display time, the system causes at least a portion of the first data to be stored in a buffer. During one or more second times prior to the display time, the system generates a prediction of a viewport for displaying the content to a user at the display time, identifies a portion of the second data corresponding to the prediction of the viewport, and causes the identified portion of the second data to be stored in the video buffer. At the display time, the system causes the content to be displayed to the user using the video buffer.
-
38.
公开(公告)号:US11323730B2
公开(公告)日:2022-05-03
申请号:US16561478
申请日:2019-09-05
Applicant: Apple Inc.
Inventor: Mei Guo , Jun Xin , Yeping Su , Hsi-Jung Wu , Alexandros Tourapis
IPC: G01R31/34 , H02P29/024 , H01H71/70 , H04N19/44 , H04N19/159 , H04N19/177 , H04N19/172 , H04N19/46
Abstract: Techniques for video exchange are disclosed in which a video source sends redundant copies of video coded at a given presentation time. A decoder may determine, from metadata stored with a first decoded frame, whether other decoded frames have been designated as correlated to the first decoded frame. If so, the decoder may fetch other decoding frames using time indicator values of other decoded frames in metadata stored with the first decoded frame. When other decoded frame(s) are found, the decoder may blend content of the first coded frame and the other decoded frame(s). The decoder may render the blended frame. When one such coded frame is an intra-coded frame, the proposed technique may alleviate beating artifacts by blending recovered data of the intra-coded frame with recovered data of another frame, which may be inter-coded.
-
公开(公告)号:US11240492B2
公开(公告)日:2022-02-01
申请号:US16254528
申请日:2019-01-22
Applicant: Apple Inc.
Inventor: Jiefu Zhai , Xingyu Zhang , Xiaosong Zhou , Jun Xin , Hsi-Jung Wu , Yeping Su
IPC: H04N19/105 , H04N19/61 , H04N19/147 , H04N19/159 , H04N19/176 , G06N3/08
Abstract: Systems and methods disclosed for video compression, utilizing neural networks for predictive video coding. Processes employed combine multiple banks of neural networks with codec system components to carry out the coding and decoding of video data.
-
公开(公告)号:US10972753B1
公开(公告)日:2021-04-06
申请号:US16569725
申请日:2019-09-13
Applicant: Apple Inc.
Inventor: Fanyi Duanmu , Eduardo Asbun , Xiaosong Zhou , Jun Xin , Hsi-Jung Wu , John Su , Samir Gehani , Christopher Flick , Shalini Sahoo
IPC: H04N19/597 , H04N19/187 , H04N19/176
Abstract: Techniques are disclosed for coding and delivering multi-view video in which the video is represented as a manifest file identifying a plurality of segments of the video available for download. The multi-view video may be partitioned spatially into a plurality of tiles that, in aggregate, encompass the entire spatial area of the video. The tiles are coded as segments contains coded video representing content contained within its respective tile. Tiles may be given different sizes based on saliency of the content within their respective regions. In this manner, tiles with high levels of interest may have relatively large spatial areas, which can lead to efficient coding in the presence of content motion.
-
-
-
-
-
-
-
-
-