Detecting digital objects and generating object masks on device

    公开(公告)号:US12272127B2

    公开(公告)日:2025-04-08

    申请号:US17589114

    申请日:2022-01-31

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that generates object masks for digital objects portrayed in digital images utilizing a detection-masking neural network pipeline. In particular, in one or more embodiments, the disclosed systems utilize detection heads of a neural network to detect digital objects portrayed within a digital image. In some cases, each detection head is associated with one or more digital object classes that are not associated with the other detection heads. Further, in some cases, the detection heads implement multi-scale synchronized batch normalization to normalize feature maps across various feature levels. The disclosed systems further utilize a masking head of the neural network to generate one or more object masks for the detected digital objects. In some cases, the disclosed systems utilize post-processing techniques to filter out low-quality masks.

    SEMANTIC IMAGE SYNTHESIS
    93.
    发明申请

    公开(公告)号:US20250086849A1

    公开(公告)日:2025-03-13

    申请号:US18463333

    申请日:2023-09-08

    Applicant: ADOBE INC.

    Abstract: Embodiments of the present disclosure include obtaining a text prompt describing an element, layout information indicating a target region for the element, and a precision level corresponding to the element. Some embodiments generate a text feature pyramid based on the text prompt, the layout information, and the precision level, wherein the text feature pyramid comprises a plurality of text feature maps at a plurality of scales, respectively. Then, an image is generated based on the text feature pyramid. In some cases, the image includes an object corresponding to the element of the text prompt at the target region. Additionally, a shape of the object corresponds to a shape of the target region based on the precision level.

    Generating shadows for digital objects within digital images utilizing a height map

    公开(公告)号:US12169895B2

    公开(公告)日:2024-12-17

    申请号:US17502782

    申请日:2021-10-15

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that generate a height map for a digital object portrayed in a digital image and further utilizes the height map to generate a shadow for the digital object. Indeed, in one or more embodiments, the disclosed systems generate (e.g., utilizing a neural network) a height map that indicates the pixels heights for pixels of a digital object portrayed in a digital image. The disclosed systems utilize the pixel heights, along with lighting information for the digital image, to determine how the pixels of the digital image project to create a shadow for the digital object. Further, in some implementations, the disclosed systems utilize the determined shadow projections to generate (e.g., utilizing another neural network) a soft shadow for the digital object. Accordingly, in some cases, the disclosed systems modify the digital image to include the shadow.

    Dynamic log depth compression estimation system

    公开(公告)号:US12125227B2

    公开(公告)日:2024-10-22

    申请号:US17656605

    申请日:2022-03-25

    Applicant: Adobe Inc.

    Inventor: Jianming Zhang

    CPC classification number: G06T7/593 G06F17/11 G06N20/00

    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for training and/or implementing machine learning models utilizing compressed log scene measurement maps. For example, the disclosed system generates compressed log scene measurement maps by converting scene measurement maps to compressed log scene measurement maps by applying a logarithmic function. In particular, the disclosed system uses scene measurement distribution metrics from a digital image to determine a base for the logarithmic function. In this way, the compressed log scene measurement maps normalize ranges within a digital image and accurately differentiates between scene elements objects at a variety of depths. Moreover, for training, the disclosed system generates a predicted scene measurement map via a machine learning model and compares the predicted scene measurement map with a compressed log ground truth map. By doing so, the disclosed system trains the machine learning model to generate accurate compressed log depth maps.

    Generating image masks from digital images via color density estimation and deep learning models

    公开(公告)号:US12008734B2

    公开(公告)日:2024-06-11

    申请号:US17823364

    申请日:2022-08-30

    Applicant: Adobe Inc.

    Inventor: Jianming Zhang

    Abstract: This disclosure describes methods, non-transitory computer readable storage media, and systems that utilize color density estimation in a blended boundary region of a digital image to generate an image mask. For example, the disclosed system extracts a foreground region, a background region, and a blended boundary region from a digital image. The disclosed system determines a color histogram—within a color space selected utilizing the foreground region and the background region—for a portion of the background region along an edge of the blended boundary region. Additionally, the disclosed system generates a color density map for the blended boundary region by comparing colors in the blended boundary region to colors in the color histogram of the background band. The disclosed system then generates a final mask for the digital image based on the color density map.

Patent Agency Ranking