IMAGE GENERATION METHOD, APPARATUS, ELECTRONIC DEVICE, AND STORAGE MEDIUM

    公开(公告)号:US20250157150A1

    公开(公告)日:2025-05-15

    申请号:US18941402

    申请日:2024-11-08

    Applicant: Lemon Inc.

    Abstract: Embodiments of the present disclosure disclose an image generation method, an apparatus, an electronic device, and a storage medium. The method includes: determining three-dimensional representations of preset areas in a target object according to a noise vector, wherein the three-dimensional representations are used to represent features of points in a space, and the preset areas have different size percentages in the target object; determining a three-dimensional mesh model in a target posture according to posture control parameters of the preset areas; sampling corresponding areas in the three-dimensional mesh model respectively according to camera poses for the preset areas, to obtain sampling points corresponding to the preset areas; determining target features corresponding to the sampling points according to the three-dimensional representations of the preset areas; and rendering the preset areas according to the target features, to generate target images, wherein the target images contain the target object in the target posture.

    MULTI-DIMENSIONAL GENERATIVE FRAMEWORK FOR VIDEO GENERATION

    公开(公告)号:US20240193412A1

    公开(公告)日:2024-06-13

    申请号:US18063843

    申请日:2022-12-09

    Applicant: Lemon Inc.

    CPC classification number: G06N3/08 G06T2207/20081

    Abstract: Generating a multi-dimensional video using a multi-dimensional video generative model for, including, but not limited to, at least one of static portrait animation, video reconstruction, or motion editing. The method including providing data into the multi-dimensionally aware generator of the multi-dimensional video generative model, and generating the multi-dimensional video from the data by the multi-dimensionally aware generator. The generating of the multi-dimensional video includes inverting the data into a latent space of the multi-dimensionally aware generator, synthesizing content of the multi-dimensional video using an appearance component of the multi-dimensionally aware generator and corresponding camera pose and formulating an intermediate appearance code, developing a synthesis layer for encoding a motion component of the multi-dimensionally aware generator at a plurality of timesteps and formulating an intermediate motion code, introducing temporal dynamics into the intermediate appearance code and the intermediate motion code, and generating multi-dimensionally aware spatio-temporal representations of the data.

Patent Agency Ranking