Generating improved alpha mattes for digital images based on pixel classification probabilities across alpha-range classifications

    公开(公告)号:US12236610B2

    公开(公告)日:2025-02-25

    申请号:US17500736

    申请日:2021-10-13

    Applicant: Adobe Inc.

    Abstract: This disclosure describes one or more implementations of an alpha matting system that utilizes a deep learning model to generate alpha mattes for digital images utilizing an alpha-range classifier function. More specifically, in various implementations, the alpha matting system builds and utilizes an object mask neural network having a decoder that includes an alpha-range classifier to determine classification probabilities for pixels of a digital image with respect to multiple alpha-range classifications. In addition, the alpha matting system can utilize a refinement model to generate the alpha matte from the pixel classification probabilities with respect to the multiple alpha-range classifications.

    AUTOMATIC REMOVAL OF LIGHTING EFFECTS FROM AN IMAGE

    公开(公告)号:US20240404138A1

    公开(公告)日:2024-12-05

    申请号:US18205322

    申请日:2023-06-02

    Applicant: Adobe Inc.

    Abstract: In accordance with the described techniques, an image delighting system receives an input image depicting a human subject that includes lighting effects. The image delighting system further generates a segmentation mask and a skin tone mask. The segmentation mask includes multiple segments each representing a different portion of the human subject, and the skin tone mask identifies one or more color values for a skin region of the human subject. Using a machine learning lighting removal network, the image delighting system generates an unlit image by removing the lighting effects from the input image based on the segmentation mask and the skin tone mask.

    GENERATING ALPHA MATTES FOR DIGITAL IMAGES UTILIZING A TRANSFORMER-BASED ENCODER-DECODER

    公开(公告)号:US20240362825A1

    公开(公告)日:2024-10-31

    申请号:US18762395

    申请日:2024-07-02

    Applicant: Adobe Inc.

    CPC classification number: G06T9/002 G06N3/04 G06T3/4046 G06T7/11 G06T7/194

    Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for utilizing a transformer-based encoder-decoder neural network architecture for generating alpha mattes for digital images. Specifically, the disclosed system utilizes a transformer encoder to generate patch-based encodings from a digital image and a trimap segmentation by generating patch encodings for image patches and comparing the patch encodings to areas of the digital image. Additionally, the disclosed system generates modified patch-based encodings utilizing a plurality of neural network layers. The disclosed system also generates an alpha matte for the digital image from the patch-based encodings utilizing a decoder that includes a plurality of upsampling layers connected to a plurality of neural network layers via skip connections. In additional embodiments, the disclosed system generates the alpha matte based on additional encodings generated by a plurality of convolutional neural network layers connected to a subset of the upsampling layers via skip connections.

    UTILIZING A TWO-STREAM ENCODER NEURAL NETWORK TO GENERATE COMPOSITE DIGITAL IMAGES

    公开(公告)号:US20220012885A1

    公开(公告)日:2022-01-13

    申请号:US17483280

    申请日:2021-09-23

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to utilizing a neural network having a two-stream encoder architecture to accurately generate composite digital images that realistically portray a foreground object from one digital image against a scene from another digital image. For example, the disclosed systems can utilize a foreground encoder of the neural network to identify features from a foreground image and further utilize a background encoder to identify features from a background image. The disclosed systems can then utilize a decoder to fuse the features together and generate a composite digital image. The disclosed systems can train the neural network utilizing an easy-to-hard data augmentation scheme implemented via self-teaching. The disclosed systems can further incorporate the neural network within an end-to-end framework for automation of the image composition process.

    UTILIZING A NEURAL NETWORK HAVING A TWO-STREAM ENCODER ARCHITECTURE TO GENERATE COMPOSITE DIGITAL IMAGES

    公开(公告)号:US20210027470A1

    公开(公告)日:2021-01-28

    申请号:US16523465

    申请日:2019-07-26

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to utilizing a neural network having a two-stream encoder architecture to accurately generate composite digital images that realistically portray a foreground object from one digital image against a scene from another digital image. For example, the disclosed systems can utilize a foreground encoder of the neural network to identify features from a foreground image and further utilize a background encoder to identify features from a background image. The disclosed systems can then utilize a decoder to fuse the features together and generate a composite digital image. The disclosed systems can train the neural network utilizing an easy-to-hard data augmentation scheme implemented via self-teaching. The disclosed systems can further incorporate the neural network within an end-to-end framework for automation of the image composition process.

    MARKING-BASED PORTRAIT RELIGHTING
    18.
    发明申请

    公开(公告)号:US20240404188A1

    公开(公告)日:2024-12-05

    申请号:US18205279

    申请日:2023-06-02

    Applicant: Adobe Inc.

    Abstract: In accordance with the described techniques, a portrait relighting system receives user input defining one or more markings drawn on a portrait image. Using one or more machine learning models, the portrait relighting system generates an albedo representation of the portrait image by removing lighting effects from the portrait image. Further, the portrait relighting system generates a shading map of the portrait image using the one or more machine learning models by designating the one or more markings as a lighting condition, and applying the lighting condition to a geometric representation of the portrait image. The one or more machine learning models are further employed to generate a relit portrait image based on the albedo representation and the shading map.

    Generating alpha mattes for digital images utilizing a transformer-based encoder-decoder

    公开(公告)号:US12051225B2

    公开(公告)日:2024-07-30

    申请号:US17513559

    申请日:2021-10-28

    Applicant: Adobe Inc.

    CPC classification number: G06T9/002 G06N3/04 G06T3/4046 G06T7/11 G06T7/194

    Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for utilizing a transformer-based encoder-decoder neural network architecture for generating alpha mattes for digital images. Specifically, the disclosed system utilizes a transformer encoder to generate patch-based encodings from a digital image and a trimap segmentation by generating patch encodings for image patches and comparing the patch encodings to areas of the digital image. Additionally, the disclosed system generates modified patch-based encodings utilizing a plurality of neural network layers. The disclosed system also generates an alpha matte for the digital image from the patch-based encodings utilizing a decoder that includes a plurality of upsampling layers connected to a plurality of neural network layers via skip connections. In additional embodiments, the disclosed system generates the alpha matte based on additional encodings generated by a plurality of convolutional neural network layers connected to a subset of the upsampling layers via skip connections.

Patent Agency Ranking