PRE-SEARCH CONTENT RECOMMENDATIONS

    公开(公告)号:US20220366295A1

    公开(公告)日:2022-11-17

    申请号:US17319579

    申请日:2021-05-13

    Applicant: INTUIT INC.

    Abstract: Aspects of the present disclosure provide techniques for training a machine learning model. Embodiments include providing features of a plurality of content items as inputs to an embedding model and receiving embeddings of the plurality of content items as outputs from the embedding model. Embodiments include receiving a data set comprising features of a plurality of users associated with content items of the plurality of content items that correspond to the plurality of users. Embodiments include generating a training data set for a machine learning model, wherein the training data set comprises the features of the plurality of users associated with respective labels indicating which respective embeddings of the embeddings correspond to each respective user of the plurality of users. Embodiments include training the machine learning model, using the training data set, to output corresponding embeddings of relevant content items for users based on features of the users.

    LANGUAGE AGNOSTIC ROUTING PREDICTION FOR TEXT QUERIES

    公开(公告)号:US20230281399A1

    公开(公告)日:2023-09-07

    申请号:US17653426

    申请日:2022-03-03

    Applicant: INTUIT INC.

    CPC classification number: G06F40/58 G06F40/56 G06K9/6257

    Abstract: Embodiments disclosed herein provide language-agnostic routing prediction models. The routing prediction models input text queries in any language and generate a routing prediction for the text queries. For a language that may have sparse training text data, the models, which are machine learning models, are trained using a machine translation to a prevalent language (e.g., English) to the language having sparse training text data -with the original text corpus and the translated text corpus being an input to multi-language embedding layers. The trained machine learning model makes routing predictions for text queries for the language having sparse training text data.

Patent Agency Ranking