-
公开(公告)号:US12105767B2
公开(公告)日:2024-10-01
申请号:US17735748
申请日:2022-05-03
Applicant: Adobe Inc.
Inventor: Zhaowen Wang , Yue Bai , John Philip Collomosse
IPC: G06F16/9537 , G06F40/103 , G06F40/30 , G06V30/19 , G06K15/02 , G06N3/08 , G06N20/00 , G06V10/82 , G06V30/412 , G06V30/414
CPC classification number: G06F16/9537 , G06F40/103 , G06F40/30 , G06V30/19127 , G06K15/1885 , G06N3/08 , G06N20/00 , G06V10/82 , G06V30/412 , G06V30/414
Abstract: Digital content layout encoding techniques for search are described. In these techniques, a layout representation is generated (using machine learning automatically and without user intervention) that describes a layout of elements included within the digital content. In an implementation, the layout representation includes a description of both spatial and structural aspects of the elements in relation to each other. To do so, a two-pathway pipeline that is configured to model layout from both spatial and structural aspects using a spatial pathway, and a structural pathway, respectively. In one example, this is also performed through use of multi-level encoding and fusion to generate a layout representation.
-
公开(公告)号:US20220215205A1
公开(公告)日:2022-07-07
申请号:US17142030
申请日:2021-01-05
Applicant: ADOBE INC.
Inventor: Viswanathan Swaminathan , John Philip Collomosse , Eric Nguyen
IPC: G06K9/62 , G06K9/46 , G06K9/00 , G06T7/60 , G06F16/532 , G06F16/583 , G06N20/00
Abstract: A visual search system facilitates retrieval of provenance information using a machine learning model to generate content fingerprints that are invariant to benign transformations while being sensitive to manipulations. The machine learning model is trained on a training image dataset that includes original images, benign transformed variants of the original images, and manipulated variants of the original images. A loss function is used to train the machine learning model to minimize distances in an embedding space between benign transformed variants and their corresponding original images and increase distances between the manipulated variants and their corresponding original images.
-
公开(公告)号:US12147495B2
公开(公告)日:2024-11-19
申请号:US17142030
申请日:2021-01-05
Applicant: ADOBE INC.
Inventor: Viswanathan Swaminathan , John Philip Collomosse , Eric Nguyen
IPC: G06F18/214 , G06F16/532 , G06F16/583 , G06F18/213 , G06N20/00 , G06T7/60 , G06V10/40 , G06V20/00
Abstract: A visual search system facilitates retrieval of provenance information using a machine learning model to generate content fingerprints that are invariant to benign transformations while being sensitive to manipulations. The machine learning model is trained on a training image dataset that includes original images, benign transformed variants of the original images, and manipulated variants of the original images. A loss function is used to train the machine learning model to minimize distances in an embedding space between benign transformed variants and their corresponding original images and increase distances between the manipulated variants and their corresponding original images.
-
公开(公告)号:US10268928B2
公开(公告)日:2019-04-23
申请号:US15616776
申请日:2017-06-07
Applicant: Adobe Inc.
Inventor: Hailin Jin , John Philip Collomosse
Abstract: A combined structure and style network is described. Initially, a large set of training images, having a variety of different styles, is obtained. Each of these training images is associated with one of multiple different predetermined style categories indicating the image's style and one of multiple different predetermined semantic categories indicating objects depicted in the image. Groups of these images are formed, such that each group includes an anchor image having one of the styles, a positive-style example image having the same style as the anchor image, and a negative-style example image having a different style. Based on those groups, an image style network is generated to identify images having desired styling by recognizing visual characteristics of the different styles. The image style network is further combined, according to a unifying training technique, with an image structure network configured to recognize desired objects in images irrespective of image style.
-
公开(公告)号:US20240419750A1
公开(公告)日:2024-12-19
申请号:US18822367
申请日:2024-09-02
Applicant: Adobe Inc.
Inventor: Zhaowen Wang , Yue Bai , John Philip Collomosse
IPC: G06F16/9537 , G06F40/103 , G06F40/30 , G06K15/02 , G06N3/08 , G06N20/00 , G06V10/82 , G06V30/19 , G06V30/412 , G06V30/414
Abstract: Digital content layout encoding techniques for search are described. In these techniques, a layout representation is generated (using machine learning automatically and without user intervention) that describes a layout of elements included within the digital content. In an implementation, the layout representation includes a description of both spatial and structural aspects of the elements in relation to each other. To do so, a two-pathway pipeline that is configured to model layout from both spatial and structural aspects using a spatial pathway, and a structural pathway, respectively. In one example, this is also performed through use of multi-level encoding and fusion to generate a layout representation.
-
公开(公告)号:US20240169623A1
公开(公告)日:2024-05-23
申请号:US18057857
申请日:2022-11-22
Applicant: ADOBE INC.
Inventor: Yu Zeng , Zhe Lin , Jianming Zhang , Qing Liu , Jason Wen Yong Kuen , John Philip Collomosse
IPC: G06T11/60 , G06F40/295 , G06T7/11 , G06V10/774 , G06V10/776
CPC classification number: G06T11/60 , G06F40/295 , G06T7/11 , G06V10/774 , G06V10/776 , G06T2200/24 , G06T2207/20081 , G06T2207/20084
Abstract: Systems and methods for multi-modal image generation are provided. One or more aspects of the systems and methods includes obtaining a text prompt and layout information indicating a target location for an element of the text prompt within an image to be generated and computing a text feature map including a plurality of values corresponding to the element of the text prompt at pixel locations corresponding to the target location. Then the image is generated based on the text feature map using a diffusion model. The generated image includes the element of the text prompt at the target location.
-
公开(公告)号:US20230359682A1
公开(公告)日:2023-11-09
申请号:US17735748
申请日:2022-05-03
Applicant: Adobe Inc.
Inventor: Zhaowen Wang , Yue Bai , John Philip Collomosse
IPC: G06F16/9537 , G06F40/30
CPC classification number: G06F16/9537 , G06F40/30 , G06N20/00
Abstract: Digital content layout encoding techniques for search are described. In these techniques, a layout representation is generated (using machine learning automatically and without user intervention) that describes a layout of elements included within the digital content. In an implementation, the layout representation includes a description of both spatial and structural aspects of the elements in relation to each other. To do so, a two-pathway pipeline that is configured to model layout from both spatial and structural aspects using a spatial pathway, and a structural pathway, respectively. In one example, this is also performed through use of multi-level encoding and fusion to generate a layout representation.
-
公开(公告)号:US10699453B2
公开(公告)日:2020-06-30
申请号:US15679602
申请日:2017-08-17
Applicant: Adobe Inc.
Inventor: Hailin Jin , John Philip Collomosse , Brian L. Price
Abstract: Techniques and systems are described for style-aware patching of a digital image in a digital medium environment. For example, a digital image creation system generates style data for a portion to be filled of a digital image, indicating a style of an area surrounding the portion. The digital image creation system also generates content data for the portion indicating content of the digital image of the area surrounding the portion. The digital image creation system selects a source digital image based on similarity of both style and content of the source digital image at a location of the patch to the style data and content data. The digital image creation system transforms the style of the source digital image based on the style data and generates the patch from the source digital image in the transformed style for incorporation into the portion to be filled of the digital image.
-
公开(公告)号:US20250104288A1
公开(公告)日:2025-03-27
申请号:US18471456
申请日:2023-09-21
Applicant: Adobe Inc.
Inventor: Shruti Agarwal , John Philip Collomosse
IPC: G06T9/00
Abstract: Techniques for latent space based steganographic image generation are described. A processing device, for instance, receives a digital image and a secret that includes a bit string. A pretrained encoder of an autoencoder generates an embedding of the digital image that includes latent code. A secret encoder is trained and utilized to generate an embedding of the secret to act as a latent offset to the latent code. The processing device leverages a pretrained decoder of the autoencoder to generate a steganographic image based on the embedding of the secret and the embedding of the digital image. The steganographic image includes the secret and is visually indiscernible from the digital image. Further, the processing device is configured to recover the secret from the steganographic image, such as by training and leveraging a secret decoder to extract the secret.
-
公开(公告)号:US10733228B2
公开(公告)日:2020-08-04
申请号:US16432834
申请日:2019-06-05
Applicant: Adobe Inc.
Inventor: Hailin Jin , John Philip Collomosse
Abstract: Sketch and style based image retrieval in a digital medium environment is described. Initially, a user sketches an object to be searched in connection with an image search. Styled images are selected to indicate a desired style of images to be returned by the search. A search request is generated based on the sketch and selected images. Responsive to the request, an image repository is searched to identify images having the desired object and styling. To search the image repository, a neural network is utilized that is capable of recognizing the desired object in images based on visual characteristics of the sketch and independently recognizing the desired styling in images based on visual characteristics of the selected images. This independent recognition allows desired styling to be specified by selecting images having the style but not the desired object. Images having the desired object and styling are returned.
-
-
-
-
-
-
-
-
-