-
公开(公告)号:US20230298148A1
公开(公告)日:2023-09-21
申请号:US17655663
申请日:2022-03-21
Applicant: Adobe Inc.
Inventor: He Zhang , Jianming Zhang , Jose Ignacio Echevarria Vallespi , Kalyan Sunkavalli , Meredith Payne Stotzner , Yinglan Ma , Zhe Lin , Elya Shechtman , Frederick Mandia
CPC classification number: G06T5/50 , G06T7/194 , G06T7/90 , G06T11/001 , G06T2207/20084 , G06T2207/20212 , G06T2200/24 , G06T2207/20092 , G06T2207/20016 , G06T2207/20081 , G06T2207/30168
Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods that implement a dual-branched neural network architecture to harmonize composite images. For example, in one or more implementations, the transformer-based harmonization system uses a convolutional branch and a transformer branch to generate a harmonized composite image based on an input composite image and a corresponding segmentation mask. More particularly, the convolutional branch comprises a series of convolutional neural network layers followed by a style normalization layer to extract localized information from the input composite image. Further, the transformer branch comprises a series of transformer neural network layers to extract global information based on different resolutions of the input composite image. Utilizing a decoder, the transformer-based harmonization system combines the local information and the global information from the corresponding convolutional branch and transformer branch to generate a harmonized composite image.
-
公开(公告)号:US11758082B2
公开(公告)日:2023-09-12
申请号:US17526853
申请日:2021-11-15
Applicant: Adobe Inc.
Inventor: Lu Zhang , Jianming Zhang , Zhe Lin , Radomir Meeh
IPC: H04N5/262 , G11B27/031 , G06V20/40 , G06V10/20 , G06V40/18
CPC classification number: H04N5/2628 , G06V10/255 , G06V20/40 , G06V20/41 , G11B27/031 , G06V40/193
Abstract: Systems and methods provide reframing operations in a smart editing system that may generate a focal point within a mask of an object for each frame of a video segment and perform editing effects on the frames of the video segment to quickly provide users with natural video editing effects. A reframing engine may processes video clips using a segmentation and hotspot module to determine a salient region of an object, generate a mask of the object, and track the trajectory of an object in the video clips. The reframing engine may then receive reframing parameters from a crop suggestion module and a user interface. Based on the determined trajectory of an object in a video clip and reframing parameters, the reframing engine may use reframing logic to produce temporally consistent reframing effects relative to an object for the video clip.
-
73.
公开(公告)号:US20230245266A1
公开(公告)日:2023-08-03
申请号:US18298630
申请日:2023-04-11
Applicant: Adobe Inc.
Inventor: Haitian Zheng , Zhe Lin , Jingwan Lu , Scott Cohen , Jianming Zhang , Ning Su
CPC classification number: G06T3/0093 , G06T9/002 , G06T11/00 , G06V10/46 , G06V30/2504 , G06F18/213 , G06T2210/36
Abstract: This disclosure describes one or more implementations of a digital image semantic layout manipulation system that generates refined digital images resembling the style of one or more input images while following the structure of an edited semantic layout. For example, in various implementations, the digital image semantic layout manipulation system builds and utilizes a sparse attention warped image neural network to generate high-resolution warped images and a digital image layout neural network to enhance and refine the high-resolution warped digital image into a realistic and accurate refined digital image.
-
公开(公告)号:US11657546B2
公开(公告)日:2023-05-23
申请号:US17664800
申请日:2022-05-24
Applicant: Adobe Inc.
Inventor: Xin Sun , Ruben Villegas , Manuel Lagunas Arto , Jimei Yang , Jianming Zhang
CPC classification number: G06T11/001 , G06T7/11 , G06T7/194 , G06T7/90 , G06T2207/20084 , G06T2207/30196
Abstract: Introduced here are techniques for relighting an image by automatically segmenting a human object in an image. The segmented image is input to an encoder that transforms it into a feature space. The feature space is concatenated with coefficients of a target illumination for the image and input to an albedo decoder and a light transport detector to predict an albedo map and a light transport matrix, respectively. In addition, the output of the encoder is concatenated with outputs of residual parts of each decoder and fed to a light coefficients block, which predicts coefficients of the illumination for the image. The light transport matrix and predicted illumination coefficients are multiplied to obtain a shading map that can sharpen details of the image. Scaling the resulting image by the albedo map to produce the relight image. The relight image can be refined to denoise the relight image.
-
公开(公告)号:US11636570B2
公开(公告)日:2023-04-25
申请号:US17220543
申请日:2021-04-01
Applicant: Adobe Inc.
Inventor: Haitian Zheng , Zhe Lin , Jingwan Lu , Scott Cohen , Jianming Zhang , Ning Xu
Abstract: This disclosure describes one or more implementations of a digital image semantic layout manipulation system that generates refined digital images resembling the style of one or more input images while following the structure of an edited semantic layout. For example, in various implementations, the digital image semantic layout manipulation system builds and utilizes a sparse attention warped image neural network to generate high-resolution warped images and a digital image layout neural network to enhance and refine the high-resolution warped digital image into a realistic and accurate refined digital image.
-
公开(公告)号:US11605168B2
公开(公告)日:2023-03-14
申请号:US17215067
申请日:2021-03-29
Applicant: Adobe Inc.
Inventor: Mingyang Ling , Alex Filipkowski , Zhe Lin , Jianming Zhang , Samarth Gulati
IPC: G06K9/62 , G06T7/11 , G06T7/136 , G06T7/143 , G06T7/174 , G06F18/214 , G06N3/045 , G06V10/25 , G06V10/764 , G06V10/82 , G06V10/26
Abstract: Techniques are disclosed for characterizing and defining the location of a copy space in an image. A methodology implementing the techniques according to an embodiment includes applying a regression convolutional neural network (CNN) to an image. The regression CNN is configured to predict properties of the copy space such as size and type (natural or manufactured). The prediction is conditioned on a determination of the presence of the copy space in the image. The method further includes applying a segmentation CNN to the image. The segmentation CNN is configured to generate one or more pixel-level masks to define the location of copy spaces in the image, whether natural or manufactured, or to define the location of a background region of the image. The segmentation CNN may include a first stage comprising convolutional layers and a second stage comprising pairs of boundary refinement layers and bilinear up-sampling layers.
-
公开(公告)号:US11568544B2
公开(公告)日:2023-01-31
申请号:US17483280
申请日:2021-09-23
Applicant: Adobe Inc.
Inventor: Zhe Lin , Jianming Zhang , He Zhang , Federico Perazzi
Abstract: The present disclosure relates to utilizing a neural network having a two-stream encoder architecture to accurately generate composite digital images that realistically portray a foreground object from one digital image against a scene from another digital image. For example, the disclosed systems can utilize a foreground encoder of the neural network to identify features from a foreground image and further utilize a background encoder to identify features from a background image. The disclosed systems can then utilize a decoder to fuse the features together and generate a composite digital image. The disclosed systems can train the neural network utilizing an easy-to-hard data augmentation scheme implemented via self-teaching. The disclosed systems can further incorporate the neural network within an end-to-end framework for automation of the image composition process.
-
公开(公告)号:US20220335671A1
公开(公告)日:2022-10-20
申请号:US17232890
申请日:2021-04-16
Applicant: ADOBE INC
Inventor: Alan Erickson , Kalyan Sunkavalli , I-Ming Pao , Guotong Feng , Jianming Zhang , Frederick Mandia
Abstract: Systems and methods for image editing are described. Embodiments of the present disclosure provide an image editing system for performing image object replacement or image region replacement (e.g., an image editing system for replacing an object or region of an image with an object or region from another image). For example, the image editing system may replace a sky portion of an image with a more desirable sky portion from a different replacement image. According to some embodiments described herein, real-time color harmonization based on the visible sky region may be used to produce more natural colorization. In some examples, horizon-aware sky alignment and placement with advanced padding may also be used. For example, the horizons of the original image and the replacement image may be automatically detected and aligned, and color harmonization may be performed based on the aligned images.
-
公开(公告)号:US20220327657A1
公开(公告)日:2022-10-13
申请号:US17220543
申请日:2021-04-01
Applicant: Adobe Inc.
Inventor: Haitian Zheng , Zhe Lin , Jingwan Lu , Scott Cohen , Jianming Zhang , Ning Xu
Abstract: This disclosure describes one or more implementations of a digital image semantic layout manipulation system that generates refined digital images resembling the style of one or more input images while following the structure of an edited semantic layout. For example, in various implementations, the digital image semantic layout manipulation system builds and utilizes a sparse attention warped image neural network to generate high-resolution warped images and a digital image layout neural network to enhance and refine the high-resolution warped digital image into a realistic and accurate refined digital image.
-
公开(公告)号:US20220292654A1
公开(公告)日:2022-09-15
申请号:US17200338
申请日:2021-03-12
Applicant: Adobe Inc.
Inventor: He Zhang , Yifan Jiang , Yilin Wang , Jianming Zhang , Kalyan Sunkavalli , Sarah Kong , Su Chen , Sohrab Amirghodsi , Zhe Lin
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately, efficiently, and flexibly generating harmonized digital images utilizing a self-supervised image harmonization neural network. In particular, the disclosed systems can implement, and learn parameters for, a self-supervised image harmonization neural network to extract content from one digital image (disentangled from its appearance) and appearance from another from another digital image (disentangled from its content). For example, the disclosed systems can utilize a dual data augmentation method to generate diverse triplets for parameter learning (including input digital images, reference digital images, and pseudo ground truth digital images), via cropping a digital image with perturbations using three-dimensional color lookup tables (“LUTs”). Additionally, the disclosed systems can utilize the self-supervised image harmonization neural network to generate harmonized digital images that depict content from one digital image having the appearance of another digital image.
-
-
-
-
-
-
-
-
-