Semantic-aware initial latent code selection for text-guided image editing and generation

    公开(公告)号:US12254597B2

    公开(公告)日:2025-03-18

    申请号:US17709221

    申请日:2022-03-30

    Applicant: Adobe Inc.

    Abstract: An item recommendation system receives a set of recommendable items and a request to select, from the set of recommendable items, a contrast group. The item recommendation system selects a contrast group from the set of recommendable items by applying a image modification model to the set of recommendable items. The image modification model includes an item selection model configured to determine an unbiased conversion rate for each item of the set of recommendable items and select a recommended item from the set of recommendable items having a greatest unbiased conversion rate. The image modification model includes a contrast group selection model configured to select, for the recommended item, a contrast group comprising the recommended item and one or more contrast items. The item recommendation system transmits the contrast group responsive to the request.

    COMPLETE 3D OBJECT RECONSTRUCTION FROM AN INCOMPLETE IMAGE

    公开(公告)号:US20250078406A1

    公开(公告)日:2025-03-06

    申请号:US18242380

    申请日:2023-09-05

    Applicant: Adobe Inc.

    Abstract: A modeling system accesses a two-dimensional (2D) input image displayed via a user interface, the 2D input image depicting, at a first view, a first object. At least one region of the first object is not represented by pixel values of the 2D input image. The modeling system generates, by applying a 3D representation generation model to the 2D input image, a three-dimensional (3D) representation of the first object that depicts an entirety of the first object including the first region. The modeling system displays, via the user interface, the 3D representation, wherein the 3D representation is viewable via the user interface from a plurality of views including the first view.

    User-guided image generation
    3.
    发明授权

    公开(公告)号:US12230014B2

    公开(公告)日:2025-02-18

    申请号:US17680906

    申请日:2022-02-25

    Applicant: ADOBE INC.

    Abstract: An image generation system enables user input during the process of training a generative model to influence the model's ability to generate new images with desired visual features. A source generative model for a source domain is fine-tuned using training images in a target domain to provide an adapted generative model for the target domain. Interpretable factors are determined for the source generative model and the adapted generative model. A user interface is provided that enables a user to select one or more interpretable factors. The user-selected interpretable factor(s) are used to generate a user-adapted generative model, for instance, by using a loss function based on the user-selected interpretable factor(s). The user-adapted generative model can be used to create new images in the target domain.

    DIGITAL IMAGE REPOSING BASED ON MULTIPLE INPUT VIEWS

    公开(公告)号:US20250005812A1

    公开(公告)日:2025-01-02

    申请号:US18215484

    申请日:2023-06-28

    Applicant: Adobe Inc.

    Abstract: In implementations of systems for human reposing based on multiple input views, a computing device implements a reposing system to receive input data describing: input digital images; pluralities of keypoints corresponding to the input digital images, the pluralities of keypoints representing poses of a person depicted in the input digital images; and a plurality of keypoints representing a target pose. The reposing system generates selection masks corresponding to the input digital images by processing the input data using a machine learning model. The selection masks represent likelihoods of spatial correspondence between pixels of an output digital image and portions of the input digital images. The reposing system generates the output digital image depicting the person in the target pose for display in a user interface based on the selection masks and the input data.

    DIGITAL IMAGE REPOSING TECHNIQUES

    公开(公告)号:US20240428564A1

    公开(公告)日:2024-12-26

    申请号:US18213118

    申请日:2023-06-22

    Applicant: Adobe Inc.

    Abstract: In implementations of systems for generating images for human reposing, a computing device implements a reposing system to receive input data describing an input digital image depicting a person in a first pose, a first plurality of keypoints representing the first pose, and a second plurality of keypoints representing a second pose. The reposing system generates a mapping by processing the input data using a first machine learning model. The mapping indicates a plurality of first portions of the person in the second pose that are visible in the input digital image and a plurality of second portions of the person in the second pose that are invisible in the input digital image. The reposing system generates an output digital image depicting the person in the second pose by processing the mapping, the first plurality of keypoints, and the second plurality of keypoints using a second machine learning model.

    GENERATING THREE-DIMENSIONAL LOOPING ANIMATIONS FROM STILL IMAGES

    公开(公告)号:US20240428491A1

    公开(公告)日:2024-12-26

    申请号:US18340445

    申请日:2023-06-23

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to a system that utilizes neural networks to generate looping animations from still images. The system fits a 3D model to a pose of a person in a digital image. The system receives a 3D animation sequence that transitions between a starting pose and an ending pose. The system generates, utilizing an animation transition neural network, first and second 3D animation transition sequences that respectively transition between the pose of the person and the starting pose and between the ending pose and the pose of the person. The system modifies each of the 3D animation sequence, the first 3D animation transition sequence, and the second 3D animation transition sequence by applying a texture map. The system generates a looping 3D animation by combining the modified 3D animation sequence, the modified first 3D animation transition sequence, and the modified second 3D animation transition sequence.

    HIGH-RESOLUTION IMAGE GENERATION
    8.
    发明公开

    公开(公告)号:US20240320789A1

    公开(公告)日:2024-09-26

    申请号:US18585957

    申请日:2024-02-23

    Applicant: ADOBE INC.

    CPC classification number: G06T3/4053 G06T3/4046 G06T11/00

    Abstract: A method, non-transitory computer readable medium, apparatus, and system for image generation include obtaining an input image having a first resolution, where the input image includes random noise, and generating a low-resolution image based on the input image, where the low-resolution image has the first resolution. The method, non-transitory computer readable medium, apparatus, and system further include generating a high-resolution image based on the low-resolution image, where the high-resolution image has a second resolution that is greater than the first resolution.

    Few-shot image generation via self-adaptation

    公开(公告)号:US11880957B2

    公开(公告)日:2024-01-23

    申请号:US17013332

    申请日:2020-09-04

    Applicant: Adobe Inc.

    CPC classification number: G06T3/0056 G06N20/00 G06T11/00 G06T2207/20081

    Abstract: One example method involves operations for receiving a request to transform an input image into a target image. Operations further include providing the input image to a machine learning model trained to adapt images. Training the machine learning model includes accessing training data having a source domain of images and a target domain of images with a target style. Training further includes using a pre-trained generative model to generate an adapted source domain of adapted images having the target style. The adapted source domain is generated by determining a rate of change for parameters of the target style, generating weighted parameters by applying a weight to each of the parameters based on their respective rate of change, and applying the weighted parameters to the source domain. Additionally, operations include using the machine learning model to generate the target image by modifying parameters of the input image using the target style.

Patent Agency Ranking