Resolution-based video encoding
    1.
    发明授权

    公开(公告)号:US12301825B2

    公开(公告)日:2025-05-13

    申请号:US18097165

    申请日:2023-01-13

    Applicant: Apple Inc.

    Abstract: Aspects of the subject technology relate to encoding of video frames having content with a variable resolution that varies within the video frame. Aspects of the subject technology can provide an efficient encoding by using smaller macroblocks for lower resolution content within the video frame, and larger macroblocks for higher resolution content within the video frame. An encoder may be provided with resolution information for the content of the video frame, which can be used by the encoder to determine macroblock sizes, macroblock divisions, and/or prediction modes for the encoding of the video frame.

    Real time simplification of meshes

    公开(公告)号:US12256098B1

    公开(公告)日:2025-03-18

    申请号:US17691691

    申请日:2022-03-10

    Applicant: Apple Inc.

    Abstract: A decoding computing device receives a bit stream for compressed 3D volumetric content. The bit stream includes video encoded image frames comprising packed attribute patch images and depth maps for the 3D volumetric content. Instead of generating a mesh having a vertex for each depth value signaled in the depth map, the decoder performs a real-time mesh simplification process to reduce a resolution of the mesh, such that the mesh resolution is reduced without exceeding an error threshold, which may be dynamically determined. Additionally, the decoder may perform a re-meshing of particular regions of the mesh for the 3D volumetric content to avoid cracks or gaps.

    Immersive media content encoding and rendering

    公开(公告)号:US12299805B1

    公开(公告)日:2025-05-13

    申请号:US17804801

    申请日:2022-05-31

    Applicant: Apple Inc.

    Abstract: A decoding computing device determines a focus area of a view of volumetric visual content, such as a three-dimensional (3D) object or scene, and indicates the focus area in a request for compressed volumetric visual content. A server provides a depth map and an attribute atlas that includes attribute and/or texture information for the focus area that is signaled at a higher resolution than other portions of the volumetric visual content. The decoding device also applies one or more mesh simplification techniques, wherein a higher resolution mesh is used for the focus area. The textures/attributes are projected onto the mesh representations, wherein the mesh representations and projected textures are used to reconstruct the volumetric visual content. Hole filling techniques are then applied, wherein a more sophisticated hole filling technique is used in the focus area.

    Method and device for generating a 3D reconstruction of a scene with a hybrid camera rig

    公开(公告)号:US12219118B1

    公开(公告)日:2025-02-04

    申请号:US17678815

    申请日:2022-02-23

    Applicant: Apple Inc.

    Abstract: In one implementation, a camera rig comprises: a first array of image sensors arranged in a planar configuration, wherein the first array of image sensors is provided to capture a first image stream from a first perspective of a physical environment; a second array of image sensors arranged in a non-planar configuration, wherein the second array of image sensors is provided to capture a second image stream from a second perspective of the physical environment different from the first perspective; a buffer provided to store the first and second image streams; and an image processing engine provided to generate a 3D reconstruction of the physical environment based on the first and second image streams.

    Viewport adaptive volumetric content streaming and/or rendering

    公开(公告)号:US11418769B1

    公开(公告)日:2022-08-16

    申请号:US17222872

    申请日:2021-04-05

    Applicant: Apple Inc.

    Abstract: A system comprises an encoder configured to compress and encode data for three-dimensional volumetric content. The encoder also is configured to segment the three-dimensional volumetric content based on viewing areas, wherein different ones of the viewing areas correspond to visible portions of the volumetric content. The system may provide metadata to a client device to support viewport adaptive rendering of the three-dimensional volumetric content or may adaptively stream portions of the three-dimensional volumetric content to a rending device based on viewing areas of the three-dimensional volumetric content that are to be rendered at the rendering device.

    Recording Content in a Head-mounted Device

    公开(公告)号:US20240406364A1

    公开(公告)日:2024-12-05

    申请号:US18644222

    申请日:2024-04-24

    Applicant: Apple Inc.

    Abstract: A head-mounted device is provided that includes a variety of subsystems for generating extended reality content, displaying the extended reality content, and recording the extended reality content. The device may include a graphics rendering pipeline configured to render virtual content, tracking sensors configured to obtain user tracking information, a virtual content compositor configured to composite virtual frames based on the virtual content and the user tracking information, cameras configured to capture a video feed, a media merging compositor configured to overlay the composited virtual frames and the video feed, and a recording pipeline configured to record parameters, metadata, raw content, and/or adjusted content in an extended reality recording file. The extended reality recording file may have multiple discrete portions that may each be individually edited. The extended reality recording file may be used to present a replay on the head-mounted device and/or may be exported to an external device.

    Resolution budgeting by area for immersive video rendering

    公开(公告)号:US11861788B1

    公开(公告)日:2024-01-02

    申请号:US17347404

    申请日:2021-06-14

    Applicant: Apple Inc.

    CPC classification number: G06T15/08 G06T7/50 G06T9/00 G06T2207/10016

    Abstract: One or more computing devices implement a mesh analysis for evaluating meshes to be rendered when rendering immersive content. The mesh analysis identifies objects in a three-dimensional scene and determines geometrical complexity values for the objects. Objects with similar geometrical complexities are grouped into areas and a mesh vertices budget is determined for the respective areas. Metadata indicating the area definitions and corresponding mesh vertices budgets are generated. The metadata may be uploaded to a server to simplify meshes in the scene prior to streaming to a client, or the metadata may be provided to a client for use in simplifying the meshes as part of rendering the scene.

Patent Agency Ranking