EXTRACTING ATTRIBUTES FROM ARBITRARY DIGITAL IMAGES UTILIZING A MULTI-ATTRIBUTE CONTRASTIVE CLASSIFICATION NEURAL NETWORK

    公开(公告)号:US20250022252A1

    公开(公告)日:2025-01-16

    申请号:US18899571

    申请日:2024-09-27

    Applicant: Adobe Inc.

    Abstract: This disclosure describes one or more implementations of systems, non-transitory computer-readable media, and methods that extract multiple attributes from an object portrayed in a digital image utilizing a multi-attribute contrastive classification neural network. For example, the disclosed systems utilize a multi-attribute contrastive classification neural network that includes an embedding neural network, a localizer neural network, a multi-attention neural network, and a classifier neural network. In some cases, the disclosed systems train the multi-attribute contrastive classification neural network utilizing a multi-attribute, supervised-contrastive loss. In some embodiments, the disclosed systems generate negative attribute training labels for labeled digital images utilizing positive attribute labels that correspond to the labeled digital images.

    Interpretable label-attentive encoder-decoder parser

    公开(公告)号:US11544456B2

    公开(公告)日:2023-01-03

    申请号:US16810345

    申请日:2020-03-05

    Applicant: ADOBE INC.

    Abstract: Systems and methods for parsing natural language sentences using an artificial neural network (ANN) are described. Embodiments of the described systems and methods may generate a plurality of word representation matrices for an input sentence, wherein each of the word representation matrices is based on an input matrix of word vectors, a query vector, a matrix of key vectors, and a matrix of value vectors, and wherein a number of the word representation matrices is based on a number of syntactic categories, compress each of the plurality of word representation matrices to produce a plurality of compressed word representation matrices, concatenate the plurality of compressed word representation matrices to produce an output matrix of word vectors, and identify at least one word from the input sentence corresponding to a syntactic category based on the output matrix of word vectors.

    Generating dialogue responses utilizing an independent context-dependent additive recurrent neural network

    公开(公告)号:US11120801B2

    公开(公告)日:2021-09-14

    申请号:US17086805

    申请日:2020-11-02

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating dialogue responses based on received utterances utilizing an independent gate context-dependent additive recurrent neural network. For example, the disclosed systems can utilize a neural network model to generate a dialogue history vector based on received utterances and can use the dialogue history vector to generate a dialogue response. The independent gate context-dependent additive recurrent neural network can remove local context to reduce computation complexity and allow for gates at all time steps to be computed in parallel. The independent gate context-dependent additive recurrent neural network maintains the sequential nature of a recurrent neural network using the hidden vector output.

    Utilizing a gated self-attention memory network model for predicting a candidate answer match to a query

    公开(公告)号:US11113479B2

    公开(公告)日:2021-09-07

    申请号:US16569513

    申请日:2019-09-12

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that can determine an answer to a query based on matching probabilities for combinations of respective candidate answers. For example, the disclosed systems can utilize a gated-self attention mechanism (GSAM) to interpret inputs that include contextual information, a query, and candidate answers. The disclosed systems can also utilize a memory network in tandem with the GSAM to form a gated self-attention memory network (GSAMN) to refine outputs or predictions over multiple reasoning hops. Further, the disclosed systems can utilize transfer learning of the GSAM/GSAMN from an initial training dataset to a target training dataset.

    UTILIZING A GATED SELF-ATTENTION MEMORY NETWORK MODEL FOR PREDICTING A CANDIDATE ANSWER MATCH TO A QUERY

    公开(公告)号:US20210081503A1

    公开(公告)日:2021-03-18

    申请号:US16569513

    申请日:2019-09-12

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that can determine an answer to a query based on matching probabilities for combinations of respective candidate answers. For example, the disclosed systems can utilize a gated-self attention mechanism (GSAM) to interpret inputs that include contextual information, a query, and candidate answers. The disclosed systems can also utilize a memory network in tandem with the GSAM to form a gated self-attention memory network (GSAMN) to refine outputs or predictions over multiple reasoning hops. Further, the disclosed systems can utilize transfer learning of the GSAM/GSAMN from an initial training dataset to a target training dataset.

    Methods and Systems for Determining Characteristics of A Dialog Between A Computer and A User

    公开(公告)号:US20230197081A1

    公开(公告)日:2023-06-22

    申请号:US18107620

    申请日:2023-02-09

    Applicant: Adobe Inc.

    CPC classification number: G10L15/22 G10L15/02 G10L15/183

    Abstract: A computer-implemented method is disclosed for determining one or more characteristics of a dialog between a computer system and user. The method may comprise receiving a system utterance comprising one or more tokens defining one or more words generated by the computer system; receiving a user utterance comprising one or more tokens defining one or more words uttered by a user in response to the system utterance, the system utterance and the user utterance forming a dialog context; receiving one or more utterance candidates comprising one or more tokens; for each utterance candidate, generating an input sequence combining the one or more tokens of each of the system utterance, the user utterance, and the utterance candidate; and for each utterance candidate, evaluating the generated input sequence with a model to determine a probability that the utterance candidate is relevant to the dialog context.

    EXTRACTING ATTRIBUTES FROM ARBITRARY DIGITAL IMAGES UTILIZING A MULTI-ATTRIBUTE CONTRASTIVE CLASSIFICATION NEURAL NETWORK

    公开(公告)号:US20220383037A1

    公开(公告)日:2022-12-01

    申请号:US17332734

    申请日:2021-05-27

    Applicant: Adobe Inc.

    Abstract: This disclosure describes one or more implementations of systems, non-transitory computer-readable media, and methods that extract multiple attributes from an object portrayed in a digital image utilizing a multi-attribute contrastive classification neural network. For example, the disclosed systems utilize a multi-attribute contrastive classification neural network that includes an embedding neural network, a localizer neural network, a multi-attention neural network, and a classifier neural network. In some cases, the disclosed systems train the multi-attribute contrastive classification neural network utilizing a multi-attribute, supervised-contrastive loss. In some embodiments, the disclosed systems generate negative attribute training labels for labeled digital images utilizing positive attribute labels that correspond to the labeled digital images.

    EXTRACTING ENTITY RELATIONSHIPS FROM DIGITAL DOCUMENTS UTILIZING MULTI-VIEW NEURAL NETWORKS

    公开(公告)号:US20220138534A1

    公开(公告)日:2022-05-05

    申请号:US17087881

    申请日:2020-11-03

    Applicant: Adobe Inc.

    Abstract: This disclosure describes methods, non-transitory computer readable storage media, and systems that utilize a plurality of neural networks to determine structural and semantic information via different views of a word sequence and then utilize this information to extract a relationship between word sequence entities. For example, the disclosed systems generate a plurality of sets of encoded word representation vectors utilizing the plurality of neural networks. The disclosed system then extracts the relationship from an overall word representation vector generated based on the sets of encoded word representation vectors. Furthermore, the disclosed system enforces structural and semantic consistency between views via a plurality of constrains involving a control mechanism for the semantic view and a plurality of losses.

Patent Agency Ranking