SELF-SUPERVISED DOCUMENT REPRESENTATION LEARNING

    公开(公告)号:US20220382975A1

    公开(公告)日:2022-12-01

    申请号:US17333892

    申请日:2021-05-28

    Applicant: Adobe Inc.

    Abstract: One example method involves operations for a processing device that include receiving, by a machine learning model trained to generate a search result, a search query for a text input. The machine learning model is trained by receiving pre-training data that includes multiple documents. Pre-training the machine learning model by generating, using an encoder, feature embeddings for each of the documents included in the pre-training data. The feature embeddings are generated by applying a masking function to visual and textual features in the documents. Training the machine learning model also includes generating, using the feature embeddings, output features for the documents by concatenating the feature embeddings and applying a non-linear mapping to the feature embeddings. Training the machine learning model further includes applying a linear classifier to the output features. Additionally, operations include generating, for display, a search result using the machine learning model based on the input.

    OBJECT RECOGNITION AND TAGGING BASED ON FUSION DEEP LEARNING MODELS

    公开(公告)号:US20200175095A1

    公开(公告)日:2020-06-04

    申请号:US16204918

    申请日:2018-11-29

    Applicant: Adobe Inc.

    Abstract: In some embodiments, a computing system computes tags for an electronic document. The computing system identifies sets of objects for the electronic document by applying a set of object-recognition rules to the electronic document, with each object-recognition rule generating a set of identified objects. The computing system generates feature maps that represent a set of identified objects. The computing system generates a heat map that identifies attributes of the electronic document including object candidates of the electronic document by applying a page-segmentation machine-learning model to the electronic document. The computing system computes a tag by applying a fusion deep learning module to the feature map and the heat map to correlate a document object identified by the feature map with an attribute of the electronic document identified by the heat map. The computing system generates the tagged electronic document by applying the tag to the electronic document.

    Document signing and storage using data models and distributed ledgers

    公开(公告)号:US12265652B2

    公开(公告)日:2025-04-01

    申请号:US18055673

    申请日:2022-11-15

    Applicant: ADOBE INC.

    Abstract: A method includes populating a template database with templates associated with template identifiers (IDs) identifying the templates. The method also includes generating a data model that references a template within the template database, where the data model includes a template ID referencing the template in the template database, and where the template includes a parameter field. The data model further includes a template parameter to apply to the parameter field and a digital signature for at least the template ID and the template parameter. The method also includes deploying the data model within a distributed ledger.

    GENERATING AN IMPROVED NAMED ENTITY RECOGNITION MODEL USING NOISY DATA WITH A SELF-CLEANING DISCRIMINATOR MODEL

    公开(公告)号:US20250103813A1

    公开(公告)日:2025-03-27

    申请号:US18472746

    申请日:2023-09-22

    Applicant: Adobe Inc.

    Abstract: This disclosure describes one or more implementations of systems, non-transitory computer-readable media, and methods that train a named entity recognition (NER) model with noisy training data through a self-cleaning discriminator model. For example, the disclosed systems utilize a self-cleaning guided denoising framework to improve NER learning on noisy training data via a guidance training set. In one or more implementations, the disclosed systems utilize, within the denoising framework, an auxiliary discriminator model to correct noise in the noisy training data while training an NER model through the noisy training data. For example, while training the NER model to predict labels from the noisy training data, the disclosed systems utilize a discriminator model to detect noisy NER labels and reweight the noisy NER labels provided for training in the NER model.

Patent Agency Ranking