-
公开(公告)号:US20220114476A1
公开(公告)日:2022-04-14
申请号:US17070568
申请日:2020-10-14
Applicant: Adobe Inc.
Inventor: Trung Bui , Tuan Manh Lai , Quan Tran , Doo Soon Kim
IPC: G06N20/00 , G06F16/248
Abstract: This disclosure describes one or more implementations of a text sequence labeling system that accurately and efficiently utilize a joint-learning self-distillation approach to improve text sequence labeling machine-learning models. For example, in various implementations, the text sequence labeling system trains a text sequence labeling machine-learning teacher model to generate text sequence labels. The text sequence labeling system then creates and trains a text sequence labeling machine-learning student model utilizing the training and the output of the teacher model. Upon the student model achieving improved results over the teacher model, the text sequence labeling system re-initializes the teacher model with the learned model parameters of the student model and repeats the above joint-learning self-distillation framework. The text sequence labeling system then utilizes a trained text sequence labeling model to generate text sequence labels from input documents.
-
公开(公告)号:US20220100791A1
公开(公告)日:2022-03-31
申请号:US17544689
申请日:2021-12-07
Applicant: Adobe Inc.
Inventor: Ajinkya Kale , Baldo Faieta , Benjamin Leviant , Fengbin Chen , Francois Guerin , Kate Sousa , Trung Bui , Venkat Barakam , Zhe Lin
IPC: G06F16/48 , G06K9/62 , G06F16/2457 , G06F16/43
Abstract: Systems, methods, and non-transitory computer-readable media are disclosed for determining multi-term contextual tags for digital content and propagating the multi-term contextual tags to additional digital content. For instance, the disclosed systems can utilize search query supervision to determine and associate multi-term contextual tags (e.g., tags that represent a specific concept based on the order of the terms in the tag) with digital content. Furthermore, the disclosed systems can propagate the multi-term contextual tags determined for the digital content to additional digital content based on similarities between the digital content and additional digital content (e.g., utilizing clustering techniques). Additionally, the disclosed systems can provide digital content as search results based on the associated multi-term contextual tags.
-
公开(公告)号:US20220067992A1
公开(公告)日:2022-03-03
申请号:US17007693
申请日:2020-08-31
Applicant: Adobe Inc.
Inventor: Ning XU , Trung Bui , Jing Shi , Franck Dernoncourt
Abstract: This disclosure involves executing artificial intelligence models that infer image editing operations from natural language requests spoken by a user. Further, this disclosure performs the inferred image editing operations using inferred parameters for the image editing operations. Systems and methods may be provided that infer one or more image editing operations from a natural language request associated with a source image, locate areas of the source that are relevant to the one or more image editing operations to generate image masks, and performing the one or more image editing operations to generate a modified source image.
-
公开(公告)号:US20210326371A1
公开(公告)日:2021-10-21
申请号:US16849885
申请日:2020-04-15
Applicant: Adobe Inc.
Inventor: Trung Bui , Yu Gong , Tushar Dublish , Sasha Spala , Sachin Soni , Nicholas Miller , Joon Kim , Franck Dernoncourt , Carl Dockhorn , Ajinkya Kale
Abstract: Techniques and systems are described for performing semantic text searches. A semantic text-searching solution uses a machine learning system (such as a deep learning system) to determine associations between the semantic meanings of words. These associations are not limited by the spelling, syntax, grammar, or even definition of words. Instead, the associations can be based on the context in which characters, words, and/or phrases are used in relation to one another. In response to detecting a request to locate text within an electronic document associated with a keyword, the semantic text-searching solution can return strings within the document that have matching and/or related semantic meanings or contexts, in addition to exact matches (e.g., string matches) within the document. The semantic text-searching solution can then output an indication of the matching strings.
-
公开(公告)号:US20210295191A1
公开(公告)日:2021-09-23
申请号:US16825531
申请日:2020-03-20
Applicant: Adobe Inc.
Inventor: Trung Bui , Lidan Wang , Franck Dernoncourt
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for selecting hyper-parameter sets by utilizing a modified Bayesian optimization approach based on a combination of accuracy and training efficiency metrics of a machine learning model. For example, the disclosed systems can fit accuracy regression and efficiency regression models to observed metrics associated with hyper-parameter sets of a machine learning model. The disclosed systems can also implement a trade-off acquisition function that implements an accuracy-training efficiency balance metric to explore the hyper-parameter feature space and select hyper-parameters for training the machine learning model considering a balance between accuracy and training efficiency.
-
公开(公告)号:US20210216577A1
公开(公告)日:2021-07-15
申请号:US16741625
申请日:2020-01-13
Applicant: Adobe Inc.
Inventor: Jinfeng Xiao , Lidan Wang , Franck Dernoncourt , Trung Bui , Tong Sun
IPC: G06F16/33 , G06F16/953
Abstract: Techniques and systems are provided for predicting answers in response to one or more input queries. For instance, text from a corpus of text can be processed by a reader to generate one or multiple question and answer spaces. A question and answer space can include answerable questions and the answers associated with the questions (referred to as “question and answer pairs”). A query defining a question can be received (e.g., from a user input device) and processed by a retriever portion of the system. The retriever portion of the system can retrieve an answer to the question from the one or more pre-constructed question and answer spaces, and/or can determine an answer by comparing one or more answers retrieved from the one or more pre-constructed question and answer spaces to an answer generated by a retriever-reader system.
-
37.
公开(公告)号:US20210050014A1
公开(公告)日:2021-02-18
申请号:US17086805
申请日:2020-11-02
Applicant: Adobe Inc.
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating dialogue responses based on received utterances utilizing an independent gate context-dependent additive recurrent neural network. For example, the disclosed systems can utilize a neural network model to generate a dialogue history vector based on received utterances and can use the dialogue history vector to generate a dialogue response. The independent gate context-dependent additive recurrent neural network can remove local context to reduce computation complexity and allow for gates at all time steps to be computed in parallel. The independent gate context-dependent additive recurrent neural network maintains the sequential nature of a recurrent neural network using the hidden vector output.
-
公开(公告)号:US20210004576A1
公开(公告)日:2021-01-07
申请号:US17025477
申请日:2020-09-18
Applicant: Adobe Inc.
Inventor: Trung Bui , Zhe Lin , Walter Chang , Nham Le , Franck Dernoncourt
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating modified digital images based on verbal and/or gesture input by utilizing a natural language processing neural network and one or more computer vision neural networks. The disclosed systems can receive verbal input together with gesture input. The disclosed systems can further utilize a natural language processing neural network to generate a verbal command based on verbal input. The disclosed systems can select a particular computer vision neural network based on the verbal input and/or the gesture input. The disclosed systems can apply the selected computer vision neural network to identify pixels within a digital image that correspond to an object indicated by the verbal input and/or gesture input. Utilizing the identified pixels, the disclosed systems can generate a modified digital image by performing one or more editing actions indicated by the verbal input and/or gesture input.
-
39.
公开(公告)号:US20200312298A1
公开(公告)日:2020-10-01
申请号:US16366904
申请日:2019-03-27
Applicant: Adobe Inc.
Inventor: Trung Bui , Zahra Rahimi , Yinglan Ma , Seokhwan Kim , Franck Dernoncourt
IPC: G10L15/06 , G06F3/16 , G06F3/0482 , G06F3/0484 , G06F17/24 , G10L15/22 , G10L15/18 , G06N20/00
Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods that generate ground truth annotations of target utterances in digital image editing dialogues in order to create a state-driven training data set. In particular, in one or more embodiments, the disclosed systems utilize machine and user defined tags, machine learning model predictions, and user input to generate a ground truth annotation that includes frame information in addition to intent, attribute, object, and/or location information. In at least one embodiment, the disclosed systems generate ground truth annotations in conformance with an annotation ontology that results in fast and accurate digital image editing dialogue annotation.
-
40.
公开(公告)号:US20200160042A1
公开(公告)日:2020-05-21
申请号:US16192573
申请日:2018-11-15
Applicant: Adobe Inc.
Inventor: Trung Bui , Zhe Lin , Walter Chang , Nham Le , Franck Dernoncourt
Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating modified digital images based on verbal and/or gesture input by utilizing a natural language processing neural network and one or more computer vision neural networks. The disclosed systems can receive verbal input together with gesture input. The disclosed systems can further utilize a natural language processing neural network to generate a verbal command based on verbal input. The disclosed systems can select a particular computer vision neural network based on the verbal input and/or the gesture input. The disclosed systems can apply the selected computer vision neural network to identify pixels within a digital image that correspond to an object indicated by the verbal input and/or gesture input. Utilizing the identified pixels, the disclosed systems can generate a modified digital image by performing one or more editing actions indicated by the verbal input and/or gesture input.
-
-
-
-
-
-
-
-
-