-
公开(公告)号:US20230177824A1
公开(公告)日:2023-06-08
申请号:US18161666
申请日:2023-01-30
Applicant: Adobe Inc.
Inventor: Brian Price , Scott Cohen , Mai Long , Jun Hao Liew
IPC: G06V10/82 , G06N3/084 , G06T7/11 , G06V10/26 , G06V10/44 , G06F18/40 , G06N3/045 , G06N5/01 , G06V10/94 , G06V10/20
CPC classification number: G06V10/82 , G06N3/084 , G06T7/11 , G06V10/26 , G06V10/454 , G06F18/40 , G06N3/045 , G06N5/01 , G06V10/945 , G06V10/255 , G06N3/044
Abstract: Systems and methods are disclosed for selecting target objects within digital images utilizing a multi-modal object selection neural network trained to accommodate multiple input modalities. In particular, in one or more embodiments, the disclosed systems and methods generate a trained neural network based on training digital images and training indicators corresponding to various input modalities. Moreover, one or more embodiments of the disclosed systems and methods utilize a trained neural network and iterative user inputs corresponding to different input modalities to select target objects in digital images. Specifically, the disclosed systems and methods can transform user inputs into distance maps that can be utilized in conjunction with color channels and a trained neural network to identify pixels that reflect the target object.
-
公开(公告)号:US11587234B2
公开(公告)日:2023-02-21
申请号:US17151111
申请日:2021-01-15
Applicant: Adobe Inc.
Inventor: Yinan Zhao , Brian Price , Scott Cohen
Abstract: The present disclosure relates to a class-agnostic object segmentation system that automatically detects, segments, and selects objects within digital images irrespective of object semantic classifications. For example, the object segmentation system utilizes a class-agnostic object segmentation neural network to segment each pixel in a digital image into an object mask. Further, in response to detecting a selection request of a target object, the object segmentation system utilizes a corresponding object mask to automatically select the target object within the digital image. In some implementations, the object segmentation system utilizes a class-agnostic object segmentation neural network to detect and automatically select a partial object in the digital image in response to a target object selection request.
-
公开(公告)号:US11468110B2
公开(公告)日:2022-10-11
申请号:US16800415
申请日:2020-02-25
Applicant: Adobe Inc.
Inventor: Walter Wei Tuh Chang , Khoi Pham , Scott Cohen , Zhe Lin , Zhihong Ding
IPC: G06F16/532 , G06F16/583 , G06F16/538 , G06F16/33 , G06T11/60 , G06K9/62 , G06F40/279 , G06F40/247 , G06N20/00 , G06F16/242 , G06F16/28 , G06F40/30
Abstract: The present disclosure relates to an object selection system that automatically detects and selects objects in a digital image based on natural language-based inputs. For instance, the object selection system can utilize natural language processing tools to detect objects and their corresponding relationships within natural language object selection queries. For example, the object selection system can determine alternative object terms for unrecognized objects in a natural language object selection query. As another example, the object selection system can determine multiple types of relationships between objects in a natural language object selection query and utilize different object relationship models to select the requested query object.
-
公开(公告)号:US11379987B2
公开(公告)日:2022-07-05
申请号:US17020023
申请日:2020-09-14
Applicant: ADOBE INC.
Inventor: Ning Xu , Brian Price , Scott Cohen
IPC: G06T7/11 , G06T7/194 , G06T7/73 , G11B27/031
Abstract: A temporal object segmentation system determines a location of an object depicted in a video. In some cases, the temporal object segmentation system determines the object's location in a particular frame of the video based on information indicating a previous location of the object in a previous video frame. For example, an encoder neural network in the temporal object segmentation system extracts features describing image attributes of a video frame. A convolutional long-short term memory neural network determines the location of the object in the frame, based on the extracted image attributes and information indicating a previous location in a previous frame. A decoder neural network generates an image mask indicating the object's location in the frame. In some cases, a video editing system receives multiple generated masks for a video, and modifies one or more video frames based on the locations indicated by the masks.
-
公开(公告)号:US11044450B2
公开(公告)日:2021-06-22
申请号:US16435371
申请日:2019-06-07
Applicant: Adobe Inc. , York University
Inventor: Mahmoud Afifi , Michael Brown , Brian Price , Scott Cohen
Abstract: Techniques are described for white balancing an input image by determining a color transformation for the input image based on color transformations that have been computed for training images whose color characteristics are similar to those of the input image. Techniques are also described for generating a training dataset comprising color information for a plurality of training images and color transformation information for the plurality of training images. The color information in the training dataset is searched to identify a subset of training images that are most similar in color to the input image. The color transformation for the input image is then computed by combining color transformation information for the identified training images. The contribution of the color transformation information for any given training image to the combination can be weighted based on the degree of color similarity between the input image and the training image.
-
公开(公告)号:US11004208B2
公开(公告)日:2021-05-11
申请号:US16365213
申请日:2019-03-26
Applicant: Adobe Inc.
Inventor: Brian Lynn Price , Scott Cohen , Marco Forte , Ning Xu
Abstract: Techniques are disclosed for deep neural network (DNN) based interactive image matting. A methodology implementing the techniques according to an embodiment includes generating, by the DNN, an alpha matte associated with an image, based on user-specified foreground region locations in the image. The method further includes applying a first DNN subnetwork to the image, the first subnetwork trained to generate a binary mask based on the user input, the binary mask designating pixels of the image as background or foreground. The method further includes applying a second DNN subnetwork to the generated binary mask, the second subnetwork trained to generate a trimap based on the user input, the trimap designating pixels of the image as background, foreground, or uncertain status. The method further includes applying a third DNN subnetwork to the generated trimap, the third subnetwork trained to generate the alpha matte based on the user input.
-
37.
公开(公告)号:US20210027448A1
公开(公告)日:2021-01-28
申请号:US16518850
申请日:2019-07-22
Applicant: Adobe Inc.
Inventor: Scott Cohen , Zhe Lin , Mingyang Ling
Abstract: The present disclosure relates to an object selection system that accurately detects and automatically selects target instances of user-requested objects (e.g., a query object instance) in a digital image. In one or more embodiments, the object selection system can analyze one or more user inputs to determine an optimal object attribute detection model from multiple specialized and generalized object attribute models. Additionally, the object selection system can utilize the selected object attribute model to detect and select one or more target instances of a query object in an image, where the image includes multiple instances of the query object.
-
公开(公告)号:US20200349464A1
公开(公告)日:2020-11-05
申请号:US16401548
申请日:2019-05-02
Applicant: Adobe Inc.
Inventor: Zhe Lin , Trung Huu Bui , Scott Cohen , Mingyang Ling , Chenyun Wu
IPC: G06N20/00
Abstract: Techniques and systems are provided for training a machine learning model using different datasets to perform one or more tasks. The machine learning model can include a first sub-module configured to perform a first task and a second sub-module configured to perform a second task. The first sub-module can be selected for training using a first training dataset based on a format of the first training dataset. The first sub-module can then be trained using the first training dataset to perform the first task. The second sub-module can be selected for training using a second training dataset based on a format of the second training dataset. The second sub-module can then be trained using the second training dataset to perform the second task.
-
公开(公告)号:US20200311946A1
公开(公告)日:2020-10-01
申请号:US16365213
申请日:2019-03-26
Applicant: Adobe Inc.
Inventor: Brian Lynn Price , Scott Cohen , Marco Forte , Ning Xu
Abstract: Techniques are disclosed for deep neural network (DNN) based interactive image matting. A methodology implementing the techniques according to an embodiment includes generating, by the DNN, an alpha matte associated with an image, based on user-specified foreground region locations in the image. The method further includes applying a first DNN subnetwork to the image, the first subnetwork trained to generate a binary mask based on the user input, the binary mask designating pixels of the image as background or foreground. The method further includes applying a second DNN subnetwork to the generated binary mask, the second subnetwork trained to generate a trimap based on the user input, the trimap designating pixels of the image as background, foreground, or uncertain status. The method further includes applying a third DNN subnetwork to the generated trimap, the third subnetwork trained to generate the alpha matte based on the user input.
-
公开(公告)号:US10754851B2
公开(公告)日:2020-08-25
申请号:US15852506
申请日:2017-12-22
Applicant: Adobe Inc.
Inventor: Scott Cohen , Kushal Kafle , Brian Price
IPC: G06F16/30 , G06F16/242 , G06T11/20 , G06K9/18 , G06N3/02 , G06F16/26 , G06F16/248 , G06F16/2457 , G06N3/04 , G06N3/08 , G06N5/04
Abstract: Systems and techniques are described that provide for question answering using data visualizations, such as bar graphs. Such data visualizations are often generated from collected data, and provided within image files that illustrate the underlying data and relationships between data elements. The described techniques analyze a query and a related data visualization, and identify one or more spatial regions within the data visualization in which an answer to the query may be found.
-
-
-
-
-
-
-
-
-