TRANSFORMER-BASED IMAGE SEGMENTATION ON MOBILE DEVICES

    公开(公告)号:US20240281978A1

    公开(公告)日:2024-08-22

    申请号:US18170336

    申请日:2023-02-16

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for generating segmentation masks for a digital visual media item. In particular, in one or more embodiments, the disclosed systems generate, utilizing a neural network encoder, high-level features of a digital visual media item. Further, the disclosed systems generate, utilizing the neural network encoder, low-level features of the digital visual media item. In some implementations, the disclosed systems generate, utilizing a neural network decoder, an initial segmentation mask of the digital visual media item from the low-level features. Moreover, the disclosed systems generate, utilizing the neural network decoder, a refined segmentation mask of the digital visual media item from the initial segmentation mask and the high-level features.

    SEMANTIC IMAGE SYNTHESIS
    7.
    发明申请

    公开(公告)号:US20250086849A1

    公开(公告)日:2025-03-13

    申请号:US18463333

    申请日:2023-09-08

    Applicant: ADOBE INC.

    Abstract: Embodiments of the present disclosure include obtaining a text prompt describing an element, layout information indicating a target region for the element, and a precision level corresponding to the element. Some embodiments generate a text feature pyramid based on the text prompt, the layout information, and the precision level, wherein the text feature pyramid comprises a plurality of text feature maps at a plurality of scales, respectively. Then, an image is generated based on the text feature pyramid. In some cases, the image includes an object corresponding to the element of the text prompt at the target region. Additionally, a shape of the object corresponds to a shape of the target region based on the precision level.

    PANOPTICALLY GUIDED INPAINTING UTILIZING A PANOPTIC INPAINTING NEURAL NETWORK

    公开(公告)号:US20240127410A1

    公开(公告)日:2024-04-18

    申请号:US17937695

    申请日:2022-10-03

    Applicant: Adobe Inc.

    CPC classification number: G06T5/005 G06T7/11 G06T2207/20084

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for panoptically guiding digital image inpainting utilizing a panoptic inpainting neural network. In some embodiments, the disclosed systems utilize a panoptic inpainting neural network to generate an inpainted digital image according to panoptic segmentation map that defines pixel regions corresponding to different panoptic labels. In some cases, the disclosed systems train a neural network utilizing a semantic discriminator that facilitates generation of digital images that are realistic while also conforming to a semantic segmentation. The disclosed systems generate and provide a panoptic inpainting interface to facilitate user interaction for inpainting digital images. In certain embodiments, the disclosed systems iteratively update an inpainted digital image based on changes to a panoptic segmentation map.

Patent Agency Ranking