USING LANGUAGE MODEL TO GENERATE RECIPE WITH REFINED CONTENT

    公开(公告)号:US20250086395A1

    公开(公告)日:2025-03-13

    申请号:US18244098

    申请日:2023-09-08

    Applicant: Maplebear Inc.

    Abstract: Embodiments relate to utilizing a language model to automatically generate a novel recipe with refined content, which can be offered to a user of an online system. The online system generates a first prompt for input into a large language model (LLM), the first prompt including a plurality of task requests for generating initial content of a recipe. The online system requests the LLM to generate, based on the first prompt input into the LLM, the initial content of the recipe. The online system generates a second prompt for input into the LLM, the second prompt including the initial content of the recipe and contextual information about the recipe. The online system requests the LLM to generate, based on the second prompt input into the LLM, refined content of the recipe. The online system stores the recipe with the refined content in a database of the online system.

    USING UNSUPERVISED CLUSTERING AND LANGUAGE MODEL TO NORMALIZE ATTRIBUTE TUPLES OF ITEMS IN A DATABASE

    公开(公告)号:US20250005279A1

    公开(公告)日:2025-01-02

    申请号:US18215505

    申请日:2023-06-28

    Abstract: A computer system uses clustering and a large language model (LLM) to normalize attribute tuples for items stored in a database of an online system. The online system collects attribute tuples, each attribute tuple comprising an attribute type and an attribute value for an item. The online system initially clusters the attribute tuples into a first plurality of clusters. The online system generates prompts for input into the LLM, each prompt including a subset of attribute tuples grouped into a respective cluster of the first plurality. Based on the prompts, the LLM generates a second plurality of clusters, each cluster including one or more attribute tuples that have a common attribute type and a common attribute value. The online system maps each attribute tuple to a respective normalized attribute tuple associated with each cluster. The online system rewrites each attribute tuple in the database to a corresponding normalized attribute tuple.

    Generating Sponsored Content Pages Using Large Language Machine-Learned Models

    公开(公告)号:US20250124498A1

    公开(公告)日:2025-04-17

    申请号:US18917136

    申请日:2024-10-16

    Applicant: Maplebear Inc.

    Abstract: An online system presents a sponsored content page to a user in conjunction with a model serving system. The online system accesses a content page for a food item and identifies one or more sponsorship opportunities at the content page. The online system identifies one or more candidate sponsors for each sponsorship opportunity. The online system selects a bidding sponsor for the sponsorship opportunity from the one or more candidate sponsors and a candidate item associated with the bidding sponsor as a sponsored item. The online system provides a content page, a description of the sponsored item, and a request to generate a sponsored content page for the sponsorship opportunity to a model serving system. The online system receives a sponsored content page generated by a machine-learning language model at the model serving system and presents the sponsored content page to a user.

    WEAKLY SUPERVISED EXTRACTION OF ATTRIBUTES FROM UNSTRUCTURED DATA TO GENERATE TRAINING DATA FOR MACHINE LEARNING MODELS

    公开(公告)号:US20250117442A1

    公开(公告)日:2025-04-10

    申请号:US18987482

    申请日:2024-12-19

    Applicant: Maplebear Inc.

    Abstract: An online concierge system receives unstructured data describing items offered for purchase by various warehouses. To generate attributes for products from the unstructured data, the online concierge system extracts candidate values for attributes from the unstructured data through natural language processing. One or more users associate a subset candidate values with corresponding attributes, and the online concierge system clusters the remaining candidate values with the candidate values of the subset associated with attributes. One or more users provide input on the accuracy of the generated clusters. The candidate values are applied as labels to items by the online concierge system, which uses the labeled items as training data for an attribute extraction model to predict values for one or more attributes from unstructured data about an item.

    ITEM ATTRIBUTE DETERMINATION USING A CO-ENGAGEMENT GRAPH

    公开(公告)号:US20240104632A1

    公开(公告)日:2024-03-28

    申请号:US17935916

    申请日:2022-09-27

    CPC classification number: G06Q30/0635 G06Q30/0613 G06Q30/0627 G06Q30/0639

    Abstract: An online concierge system uses a co-engagement graph to assign attribute values to items for which those attribute values are uncertain. A co-engagement graph is a graph with nodes that represent items and edges that represent co-engagement between items. The online concierge system generates a co-engagement graph for a set of items based on item engagement data and item data for the items. The set of items includes items for which the online concierge system has an attribute value for a target attribute and items for which the online concierge system does not have an attribute value for the target attribute. The online concierge system identifies a node that corresponds to an unknown item and identifies a node connected to that first node that corresponds to a known item. The online concierge system assigns the attribute value for the known item to the unknown item.

    WEAKLY SUPERVISED EXTRACTION OF ATTRIBUTES FROM UNSTRUCTURED DATA TO GENERATE TRAINING DATA FOR MACHINE LEARNING MODELS

    公开(公告)号:US20230058829A1

    公开(公告)日:2023-02-23

    申请号:US17407158

    申请日:2021-08-19

    Abstract: An online concierge system receives unstructured data describing items offered for purchase by various warehouses. To generate attributes for products from the unstructured data, the online concierge system extracts candidate values for attributes from the unstructured data through natural language processing. One or more users associate a subset candidate values with corresponding attributes, and the online concierge system clusters the remaining candidate values with the candidate values of the subset associated with attributes. One or more users provide input on the accuracy of the generated clusters. The candidate values are applied as labels to items by the online concierge system, which uses the labeled items as training data for an attribute extraction model to predict values for one or more attributes from unstructured data about an item.

Patent Agency Ranking