-
公开(公告)号:US11907839B2
公开(公告)日:2024-02-20
申请号:US17468511
申请日:2021-09-07
Applicant: Adobe Inc.
Inventor: Ratheesh Kalarot , Kevin Wampler , Jingwan Lu , Jakub Fiser , Elya Shechtman , Aliakbar Darabi , Alexandru Vasile Costin
IPC: G06N3/08 , G06F3/04845 , G06T11/60 , G06T3/40 , G06T3/00 , G06F3/04847 , G06N20/20 , G06T5/00 , G06T5/20 , G06T11/00 , G06F18/40 , G06F18/211 , G06F18/214 , G06F18/21 , G06N3/045
CPC classification number: G06N3/08 , G06F3/04845 , G06F3/04847 , G06F18/211 , G06F18/214 , G06F18/2163 , G06F18/40 , G06N3/045 , G06N20/20 , G06T3/0006 , G06T3/0093 , G06T3/40 , G06T3/4038 , G06T3/4046 , G06T5/005 , G06T5/20 , G06T11/001 , G06T11/60 , G06T2207/10024 , G06T2207/20081 , G06T2207/20084 , G06T2207/20221 , G06T2210/22
Abstract: Systems and methods combine an input image with an edited image generated using a generator neural network to preserve detail from the original image. A computing system provides an input image to a machine learning model to generate a latent space representation of the input image. The system provides the latent space representation to a generator neural network to generate a generated image. The system generates multiple scale representations of the input image, as well as multiple scale representations of the generated image. The system generates a first combined image based on first scale representations of the images and a first value. The system generates a second combined image based on second scale representations of the images and a second value. The system blends the first combined image with the second combined image to generate an output image.
-
公开(公告)号:US11875221B2
公开(公告)日:2024-01-16
申请号:US17468476
申请日:2021-09-07
Applicant: Adobe Inc.
Inventor: Wei-An Lin , Baldo Faieta , Cameron Smith , Elya Shechtman , Jingwan Lu , Jun-Yan Zhu , Niloy Mitra , Ratheesh Kalarot , Richard Zhang , Shabnam Ghadar , Zhixin Shu
IPC: G06N3/08 , G06F3/04845 , G06F3/04847 , G06T11/60 , G06T3/40 , G06N20/20 , G06T5/00 , G06T5/20 , G06T3/00 , G06T11/00 , G06F18/40 , G06F18/211 , G06F18/214 , G06F18/21 , G06N3/045
CPC classification number: G06N3/08 , G06F3/04845 , G06F3/04847 , G06F18/211 , G06F18/214 , G06F18/2163 , G06F18/40 , G06N3/045 , G06N20/20 , G06T3/0006 , G06T3/0093 , G06T3/40 , G06T3/4038 , G06T3/4046 , G06T5/005 , G06T5/20 , G06T11/001 , G06T11/60 , G06T2207/10024 , G06T2207/20081 , G06T2207/20084 , G06T2207/20221 , G06T2210/22
Abstract: Systems and methods generate a filtering function for editing an image with reduced attribute correlation. An image editing system groups training data into bins according to a distribution of a target attribute. For each bin, the system samples a subset of the training data based on a pre-determined target distribution of a set of additional attributes in the training data. The system identifies a direction in the sampled training data corresponding to the distribution of the target attribute to generate a filtering vector for modifying the target attribute in an input image, obtains a latent space representation of an input image, applies the filtering vector to the latent space representation of the input image to generate a filtered latent space representation of the input image, and provides the filtered latent space representation as input to a neural network to generate an output image with a modification to the target attribute.
-
公开(公告)号:US11853348B2
公开(公告)日:2023-12-26
申请号:US16910440
申请日:2020-06-24
Applicant: Adobe Inc.
Inventor: Akhilesh Kumar , Zhe Lin , Ratheesh Kalarot , Jinrong Xie , Jianming Zhang , Baldo Antonio Faieta , Alex Charles Filipkowski
IPC: G06F16/532 , G06F16/583 , G06F16/55 , G06F16/538 , G06N3/02 , G06N20/20
CPC classification number: G06F16/532 , G06F16/538 , G06F16/55 , G06F16/583 , G06N3/02 , G06N20/20
Abstract: Multidimensional digital content search techniques are described that support an ability of a computing device to perform search with increased granularity and flexibility over conventional techniques. In one example, a control is implemented by a computing device that defines a multidimensional (e.g., two-dimensional) continuous space. Locations in the multidimensional continuous space are usable to different search criteria through different weights applied to the criteria associated with the axes. Therefore, user interaction with this control may be used to define a location and corresponding coordinates that may act as weights to the search criteria in order to perform a search of digital content through use of a single user input.
-
公开(公告)号:US20230162407A1
公开(公告)日:2023-05-25
申请号:US17455796
申请日:2021-11-19
Applicant: ADOBE INC.
Inventor: Ratheesh Kalarot , Timothy M. Converse , Shabnam Ghadar , John Thomas Nack , Jingwan Lu , Elya Shechtman , Baldo Faieta , Akhilesh Kumar
CPC classification number: G06T11/00 , G06K9/00288 , G06K9/00268 , G06N3/08
Abstract: The present disclosure describes systems and methods for image processing. Embodiments of the present disclosure include an image processing apparatus configured to generate modified images (e.g., synthetic faces) by conditionally changing attributes or landmarks of an input image. A machine learning model of the image processing apparatus encodes the input image to obtain a joint conditional vector that represents attributes and landmarks of the input image in a vector space. The joint conditional vector is then modified, according to the techniques described herein, to form a latent vector used to generate a modified image. In some cases, the machine learning model is trained using a generative adversarial network (GAN) with a normalization technique, followed by joint training of a landmark embedding and attribute embedding (e.g., to reduce inference time).
-
公开(公告)号:US20220122307A1
公开(公告)日:2022-04-21
申请号:US17468511
申请日:2021-09-07
Applicant: Adobe Inc.
Inventor: Ratheesh Kalarot , Kevin Wampler , Jingwan Lu , Jakub Fiser , Elya Shechtman , Aliakbar Darabi , Alexandru Vasile Costin
Abstract: Systems and methods combine an input image with an edited image generated using a generator neural network to preserve detail from the original image. A computing system provides an input image to a machine learning model to generate a latent space representation of the input image. The system provides the latent space representation to a generator neural network to generate a generated image. The system generates multiple scale representations of the input image, as well as multiple scale representations of the generated image. The system generates a first combined image based on first scale representations of the images and a first value. The system generates a second combined image based on second scale representations of the images and a second value. The system blends the first combined image with the second combined image to generate an output image.
-
6.
公开(公告)号:US12211178B2
公开(公告)日:2025-01-28
申请号:US17660090
申请日:2022-04-21
Applicant: Adobe Inc.
Inventor: Tobias Hinz , Shabnam Ghadar , Richard Zhang , Ratheesh Kalarot , Jingwan Lu , Elya Shechtman
Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for combining digital images. In particular, in one or more embodiments, the disclosed systems combine latent codes of a source digital image and a target digital image utilizing a blending network to determine a combined latent encoding and generate a combined digital image from the combined latent encoding utilizing a generative neural network. In some embodiments, the disclosed systems determine an intersection face mask between the source digital image and the combined digital image utilizing a face segmentation network and combine the source digital image and the combined digital image utilizing the intersection face mask to generate a blended digital image.
-
公开(公告)号:US20240169499A1
公开(公告)日:2024-05-23
申请号:US18057930
申请日:2022-11-22
Applicant: ADOBE INC.
Inventor: Anjali Agarwal , Siavash Khodadadeh , Ratheesh Kalarot , Hui Qu , Sven C. Olsen , Shabnam Ghadar
CPC classification number: G06T5/005 , G06N3/0454 , G06T3/4046 , G06T3/4053 , G06T2207/20016 , G06T2207/20081 , G06T2207/20084 , G06T2207/30201
Abstract: Systems and methods for image processing are provided. Embodiments include identifying an image of a face that includes an artifact in a part of the face. A machine learning model generates an intermediate image based on the original image. The intermediate image depicts the part of the face in a closed position. Then the model generates a corrected image based on the intermediate image. The corrected image depicts the face with the part of the face in an open position and without the artifact.
-
公开(公告)号:US11983628B2
公开(公告)日:2024-05-14
申请号:US17468487
申请日:2021-09-07
Applicant: Adobe Inc.
Inventor: Wei-An Lin , Baldo Faieta , Cameron Smith , Elya Shechtman , Jingwan Lu , Jun-Yan Zhu , Niloy Mitra , Ratheesh Kalarot , Richard Zhang , Shabnam Ghadar , Zhixin Shu
IPC: G06N3/08 , G06F3/04845 , G06F3/04847 , G06F18/21 , G06F18/211 , G06F18/214 , G06F18/40 , G06N3/045 , G06N20/20 , G06T3/02 , G06T3/18 , G06T3/40 , G06T3/4038 , G06T3/4046 , G06T5/20 , G06T5/77 , G06T11/00 , G06T11/60
CPC classification number: G06N3/08 , G06F3/04845 , G06F3/04847 , G06F18/211 , G06F18/214 , G06F18/2163 , G06F18/40 , G06N3/045 , G06N20/20 , G06T3/02 , G06T3/18 , G06T3/40 , G06T3/4038 , G06T3/4046 , G06T5/20 , G06T5/77 , G06T11/001 , G06T11/60 , G06T2207/10024 , G06T2207/20081 , G06T2207/20084 , G06T2207/20221 , G06T2210/22
Abstract: Systems and methods dynamically adjust an available range for editing an attribute in an image. An image editing system computes a metric for an attribute in an input image as a function of a latent space representation of the input image and a filtering vector for editing the input image. The image editing system compares the metric to a threshold. If the metric exceeds the threshold, then the image editing system selects a first range for editing the attribute in the input image. If the metric does not exceed the threshold, a second range is selected. The image editing system causes display of a user interface for editing the input image comprising an interface element for editing the attribute within the selected range.
-
9.
公开(公告)号:US20230342893A1
公开(公告)日:2023-10-26
申请号:US17660090
申请日:2022-04-21
Applicant: Adobe Inc.
Inventor: Tobias Hinz , Shabnam Ghadar , Richard Zhang , Ratheesh Kalarot , Jingwan Lu , Elya Shechtman
CPC classification number: G06T5/50 , G06T11/60 , G06V10/82 , G06T2207/20221 , G06T2207/30201
Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for combining digital images. In particular, in one or more embodiments, the disclosed systems combine latent codes of a source digital image and a target digital image utilizing a blending network to determine a combined latent encoding and generate a combined digital image from the combined latent encoding utilizing a generative neural network. In some embodiments, the disclosed systems determine an intersection face mask between the source digital image and the combined digital image utilizing a face segmentation network and combine the source digital image and the combined digital image utilizing the intersection face mask to generate a blended digital image.
-
公开(公告)号:US20220270310A1
公开(公告)日:2022-08-25
申请号:US17182492
申请日:2021-02-23
Applicant: Adobe Inc.
Inventor: Akhilesh Kumar , Baldo Faieta , Piotr Walczyszyn , Ratheesh Kalarot , Archie Bagnall , Shabnam Ghadar , Wei-An Lin , Cameron Smith , Christian Cantrell , Patrick Hebron , Wilson Chan , Jingwan Lu , Holger Winnemoeller , Sven Olsen
Abstract: The present disclosure describes systems, methods, and non-transitory computer readable media for detecting user interactions to edit a digital image from a client device and modify the digital image for the client device by using a web-based intermediary that modifies a latent vector of the digital image and an image modification neural network to generate a modified digital image from the modified latent vector. In response to user interaction to modify a digital image, for instance, the disclosed systems modify a latent vector extracted from the digital image to reflect the requested modification. The disclosed systems further use a latent vector stream renderer (as an intermediary device) to generate an image delta that indicates a difference between the digital image and the modified digital image. The disclosed systems then provide the image delta as part of a digital stream to a client device to quickly render the modified digital image.
-
-
-
-
-
-
-
-
-