System and method for recognizing domain specific named entities using domain specific word embeddings

    公开(公告)号:US11687721B2

    公开(公告)日:2023-06-27

    申请号:US17380881

    申请日:2021-07-20

    Applicant: Intuit Inc.

    Abstract: Systems and methods for recognizing domain specific named entities are disclosed. An example method may be performed by one or more processors of a text incorporation system and include extracting a number of terms from a text under consideration, identifying, among the number of terms, a set of unmatched terms that do not match any of a plurality of known terms, passing each respective unmatched term to a vectorization module, embedding a vectorized version of each respective unmatched term in a vector space, comparing each vectorized version to known term vectors, passing, to a machine learning model, candidate terms corresponding to known term vectors closest to the vectorized versions, identifying, using the machine learning model, a best candidate term for each respective unmatched term, mapping the best candidate terms to unmatched terms in the text under consideration, and incorporating the text under consideration into the system based on the mappings.

    Compliance graph generation
    4.
    发明授权

    公开(公告)号:US11449685B2

    公开(公告)日:2022-09-20

    申请号:US16704801

    申请日:2019-12-05

    Applicant: INTUIT INC.

    Inventor: Conrad De Peuter

    Abstract: Certain aspects of the present disclosure provide techniques for generating a compliance graph based on a compliance rule to implement in a software program product for determining user compliance. To generate a compliance graph, an encoder receives a compliance rule in a source language and generates a set of corresponding vectors. The decoder, which has been trained using verified training pairs and synthetic data, generates a sequence of operations based on the vectors from the encoder. The sequence of operations is the used to build a graph in which each operation is a node in the graph and each node is connected to at least one other node in the same graph or a separate graph.

    Integrated machine learning and rules platform for improved accuracy and root cause analysis

    公开(公告)号:US11687799B1

    公开(公告)日:2023-06-27

    申请号:US17815744

    申请日:2022-07-28

    Applicant: INTUIT INC.

    CPC classification number: G06N5/025

    Abstract: Aspects of the present disclosure provide techniques for machine learning and rules integration. Embodiments include receiving input values corresponding to a subset of a set of input variables associated with an automated determination. Embodiments include generating a directed acyclic graph (DAG) representing a set of constraints corresponding to the set of input variables. The set of constraints relate to one or more machine learning models and one or more rules. Embodiments include receiving one or more outputs from the one or more machine learning models based on one or more of the input values. Embodiments include determining outcomes for the one or more rules based on at least one of the input values. Embodiments include populating the DAG based on the input values, the one or more outputs, and the outcomes. Embodiments include making the automated determination based on logic represented by the DAG.

    SYSTEM AND METHOD FOR RECOGNIZING DOMAIN SPECIFIC NAMED ENTITIES USING DOMAIN SPECIFIC WORD EMBEDDINGS

    公开(公告)号:US20210350081A1

    公开(公告)日:2021-11-11

    申请号:US17380881

    申请日:2021-07-20

    Applicant: Intuit Inc.

    Abstract: Systems and methods for recognizing domain specific named entities are disclosed. An example method may be performed by one or more processors of a text incorporation system and include extracting a number of terms from a text under consideration, identifying, among the number of terms, a set of unmatched terms that do not match any of a plurality of known terms, passing each respective unmatched term to a vectorization module, embedding a vectorized version of each respective unmatched term in a vector space, comparing each vectorized version to known term vectors, passing, to a machine learning model, candidate terms corresponding to known term vectors closest to the vectorized versions, identifying, using the machine learning model, a best candidate term for each respective unmatched term, mapping the best candidate terms to unmatched terms in the text under consideration, and incorporating the text under consideration into the system based on the mappings.

Patent Agency Ranking