-
公开(公告)号:US20240135613A1
公开(公告)日:2024-04-25
申请号:US18320664
申请日:2023-05-19
Applicant: Adobe Inc.
Inventor: Zhihong Ding , Scott Cohen , Matthew Joss , Jianming Zhang , Darshan Prasad , Celso Gomes , Jonathan Brandt
CPC classification number: G06T11/60 , G06F3/04842 , G06T3/40 , G06T5/005 , G06T7/50 , G06V10/761 , G06T2207/20084
Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that implement perspective-aware object move operations for digital image editing. For instance, in some embodiments, the disclosed systems determine a vanishing point associated with a digital image portraying an object. Additionally, the disclosed systems detect one or more user interactions for moving the object within the digital image. Based on moving the object with respect to the vanishing point, the disclosed systems perform a perspective-based resizing of the object within the digital image.
-
公开(公告)号:US20230360180A1
公开(公告)日:2023-11-09
申请号:US17661985
申请日:2022-05-04
Applicant: Adobe Inc.
Inventor: Haitian Zheng , Zhe Lin , Jingwan Lu , Scott Cohen , Elya Shechtman , Connelly Barnes , Jianming Zhang , Ning Xu , Sohrab Amirghodsi
CPC classification number: G06T5/005 , G06T3/4046 , G06V10/40 , G06T2207/20084
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that generate inpainted digital images utilizing a cascaded modulation inpainting neural network. For example, the disclosed systems utilize a cascaded modulation inpainting neural network that includes cascaded modulation decoder layers. For example, in one or more decoder layers, the disclosed systems start with global code modulation that captures the global-range image structures followed by an additional modulation that refines the global predictions. Accordingly, in one or more implementations, the image inpainting system provides a mechanism to correct distorted local details. Furthermore, in one or more implementations, the image inpainting system leverages fast Fourier convolutions block within different resolution layers of the encoder architecture to expand the receptive field of the encoder and to allow the network encoder to better capture global structure.
-
53.
公开(公告)号:US20230325996A1
公开(公告)日:2023-10-12
申请号:US18167690
申请日:2023-02-10
Applicant: Adobe Inc.
Inventor: Zhifei Zhang , Jianming Zhang , Scott Cohen , Zhe Lin
IPC: G06T5/50 , G06T3/40 , G06V10/60 , G06F3/04842
CPC classification number: G06T5/50 , G06T3/40 , G06V10/60 , G06F3/04842 , G06T2207/20101 , G06T2207/20104 , G06T2207/20221
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that generates composite images via auto-compositing features. For example, in one or more embodiments, the disclosed systems determine a background image and a foreground object image for use in generating a composite image. The disclosed systems further provide, for display within a graphical user interface of a client device, at least one selectable option for executing an auto-composite model for the composite image, the auto-composite model comprising at least one of a scale prediction model, a harmonization model, or a shadow generation model. The disclosed systems detect, via the graphical user interface, a user selection of the at least one selectable option and generate, in response to detecting the user selection, the composite image by executing the auto-composite model using the background image and the foreground object image.
-
54.
公开(公告)号:US20230325991A1
公开(公告)日:2023-10-12
申请号:US17658770
申请日:2022-04-11
Applicant: Adobe Inc.
Inventor: Zhe Lin , Sijie Zhu , Jason Wen Yong Kuen , Scott Cohen , Zhifei Zhang
CPC classification number: G06T5/50 , G06T7/194 , G06T5/002 , G06T3/60 , G06T2207/20084 , G06T2207/20221
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media that utilizes artificial intelligence to learn to recommend foreground object images for use in generating composite images based on geometry and/or lighting features. For instance, in one or more embodiments, the disclosed systems transform a foreground object image corresponding to a background image using at least one of a geometry transformation or a lighting transformation. The disclosed systems further generating predicted embeddings for the background image, the foreground object image, and the transformed foreground object image within a geometry-lighting-sensitive embedding space utilizing a geometry-lighting-aware neural network. Using a loss determined from the predicted embeddings, the disclosed systems update parameters of the geometry-lighting-aware neural network. The disclosed systems further provide a variety of efficient user interfaces for generating composite digital images.
-
55.
公开(公告)号:US20230245266A1
公开(公告)日:2023-08-03
申请号:US18298630
申请日:2023-04-11
Applicant: Adobe Inc.
Inventor: Haitian Zheng , Zhe Lin , Jingwan Lu , Scott Cohen , Jianming Zhang , Ning Su
CPC classification number: G06T3/0093 , G06T9/002 , G06T11/00 , G06V10/46 , G06V30/2504 , G06F18/213 , G06T2210/36
Abstract: This disclosure describes one or more implementations of a digital image semantic layout manipulation system that generates refined digital images resembling the style of one or more input images while following the structure of an edited semantic layout. For example, in various implementations, the digital image semantic layout manipulation system builds and utilizes a sparse attention warped image neural network to generate high-resolution warped images and a digital image layout neural network to enhance and refine the high-resolution warped digital image into a realistic and accurate refined digital image.
-
公开(公告)号:US20230136913A1
公开(公告)日:2023-05-04
申请号:US18147278
申请日:2022-12-28
Applicant: Adobe Inc.
Inventor: Yinan Zhao , Brian Price , Scott Cohen
Abstract: The present disclosure relates to a class-agnostic object segmentation system that automatically detects, segments, and selects objects within digital images irrespective of object semantic classifications. For example, the object segmentation system utilizes a class-agnostic object segmentation neural network to segment each pixel in a digital image into an object mask. Further, in response to detecting a selection request of a target object, the object segmentation system utilizes a corresponding object mask to automatically select the target object within the digital image. In some implementations, the object segmentation system utilizes a class-agnostic object segmentation neural network to detect and automatically select a partial object in the digital image in response to a target object selection request.
-
公开(公告)号:US11636570B2
公开(公告)日:2023-04-25
申请号:US17220543
申请日:2021-04-01
Applicant: Adobe Inc.
Inventor: Haitian Zheng , Zhe Lin , Jingwan Lu , Scott Cohen , Jianming Zhang , Ning Xu
Abstract: This disclosure describes one or more implementations of a digital image semantic layout manipulation system that generates refined digital images resembling the style of one or more input images while following the structure of an edited semantic layout. For example, in various implementations, the digital image semantic layout manipulation system builds and utilizes a sparse attention warped image neural network to generate high-resolution warped images and a digital image layout neural network to enhance and refine the high-resolution warped digital image into a realistic and accurate refined digital image.
-
公开(公告)号:US11636270B2
公开(公告)日:2023-04-25
申请号:US16775697
申请日:2020-01-29
Applicant: ADOBE INC.
Inventor: Zhe Lin , Walter W. Chang , Scott Cohen , Khoi Viet Pham , Jonathan Brandt , Franck Dernoncourt
IPC: G06F40/30 , G06F16/532 , G06F16/55 , G06F40/205 , G06F40/295 , G06N5/02 , G06N5/04 , G06N20/00 , G06V10/40 , G06V30/10
Abstract: Embodiments of the present invention provide systems, methods, and non-transitory computer storage media for parsing a given input referring expression into a parse structure and generating a semantic computation graph to identify semantic relationships among and between objects. At a high level, when embodiments of the preset invention receive a referring expression, a parse tree is created and mapped into a hierarchical subject, predicate, object graph structure that labeled noun objects in the referring expression, the attributes of the labeled noun objects, and predicate relationships (e.g., verb actions or spatial propositions) between the labeled objects. Embodiments of the present invention then transform the subject, predicate, object graph structure into a semantic computation graph that may be recursively traversed and interpreted to determine how noun objects, their attributes and modifiers, and interrelationships are provided to downstream image editing, searching, or caption indexing tasks.
-
公开(公告)号:US20220383037A1
公开(公告)日:2022-12-01
申请号:US17332734
申请日:2021-05-27
Applicant: Adobe Inc.
Inventor: Khoi Pham , Kushal Kafle , Zhe Lin , Zhihong Ding , Scott Cohen , Quan Tran
Abstract: This disclosure describes one or more implementations of systems, non-transitory computer-readable media, and methods that extract multiple attributes from an object portrayed in a digital image utilizing a multi-attribute contrastive classification neural network. For example, the disclosed systems utilize a multi-attribute contrastive classification neural network that includes an embedding neural network, a localizer neural network, a multi-attention neural network, and a classifier neural network. In some cases, the disclosed systems train the multi-attribute contrastive classification neural network utilizing a multi-attribute, supervised-contrastive loss. In some embodiments, the disclosed systems generate negative attribute training labels for labeled digital images utilizing positive attribute labels that correspond to the labeled digital images.
-
公开(公告)号:US20220327657A1
公开(公告)日:2022-10-13
申请号:US17220543
申请日:2021-04-01
Applicant: Adobe Inc.
Inventor: Haitian Zheng , Zhe Lin , Jingwan Lu , Scott Cohen , Jianming Zhang , Ning Xu
Abstract: This disclosure describes one or more implementations of a digital image semantic layout manipulation system that generates refined digital images resembling the style of one or more input images while following the structure of an edited semantic layout. For example, in various implementations, the digital image semantic layout manipulation system builds and utilizes a sparse attention warped image neural network to generate high-resolution warped images and a digital image layout neural network to enhance and refine the high-resolution warped digital image into a realistic and accurate refined digital image.
-
-
-
-
-
-
-
-
-