-
公开(公告)号:US20250117442A1
公开(公告)日:2025-04-10
申请号:US18987482
申请日:2024-12-19
Applicant: Maplebear Inc.
Inventor: Shih-Ting Lin , Jonathan Newman , Min Xie , Haixun Wang
IPC: G06F18/23 , G06F18/21 , G06F18/214 , G06F18/22 , G06Q30/0601
Abstract: An online concierge system receives unstructured data describing items offered for purchase by various warehouses. To generate attributes for products from the unstructured data, the online concierge system extracts candidate values for attributes from the unstructured data through natural language processing. One or more users associate a subset candidate values with corresponding attributes, and the online concierge system clusters the remaining candidate values with the candidate values of the subset associated with attributes. One or more users provide input on the accuracy of the generated clusters. The candidate values are applied as labels to items by the online concierge system, which uses the labeled items as training data for an attribute extraction model to predict values for one or more attributes from unstructured data about an item.
-
公开(公告)号:US12210591B2
公开(公告)日:2025-01-28
申请号:US17407158
申请日:2021-08-19
Applicant: Maplebear Inc.
Inventor: Shih-Ting Lin , Jonathan Newman , Min Xie , Haixun Wang
IPC: G06K9/62 , G06F18/21 , G06F18/214 , G06F18/22 , G06F18/23 , G06Q30/06 , G06Q30/0601
Abstract: An online concierge system receives unstructured data describing items offered for purchase by various warehouses. To generate attributes for products from the unstructured data, the online concierge system extracts candidate values for attributes from the unstructured data through natural language processing. One or more users associate a subset candidate values with corresponding attributes, and the online concierge system clusters the remaining candidate values with the candidate values of the subset associated with attributes. One or more users provide input on the accuracy of the generated clusters. The candidate values are applied as labels to items by the online concierge system, which uses the labeled items as training data for an attribute extraction model to predict values for one or more attributes from unstructured data about an item.
-
公开(公告)号:US20230036666A1
公开(公告)日:2023-02-02
申请号:US17387943
申请日:2021-07-28
Applicant: Maplebear Inc. (dba Instacart)
Inventor: Yilena Xu , Qingyuan Chen , Laurentia Romaniuk , Jonathan Newman , Josh Roberts , Conor Woods , Lorna Qin
Abstract: An online concierge system offers items for sale and uses different units of measurements for items, allowing users to purchase numbers of an item, a weight of an item, or a packages of an item. To avoid confusing users with multiple options for specifying a quantity of an item, the online concierge system determines whether to present an interface for purchasing an item by a number of the item, by weight of the item, or a number of packages in the item from dynamic information about the item. For example, the online concierge system obtains historical pricing information an items and computes a par-weight price for the item from prior purchases of the item. The online concierge system uses the par-weight of the item to select an interface for a user when purchasing the item.
-
公开(公告)号:US11978104B2
公开(公告)日:2024-05-07
申请号:US17894839
申请日:2022-08-24
Applicant: Maplebear Inc.
Inventor: Saurav Manchanda , Min Xie , Gordon McCreight , Jonathan Newman
IPC: G06Q30/0601 , G06F16/56 , G06F16/903 , G06F40/166 , G06F40/284 , G06F40/30 , G06Q30/0201
CPC classification number: G06Q30/0629 , G06F16/56 , G06F16/90344 , G06F40/166 , G06F40/284 , G06F40/30 , G06Q30/0201
Abstract: A server receives a plurality of product data entries from a plurality of retailer computing systems. Each product data entry includes a product identifier uniquely identifying a corresponding physical product and descriptive data of the corresponding physical product. A subset of the plurality of product data entries having a same product identifier is determined. An embedding vector representative of a product data entry in the subset is pairwise compared with each of respective embedding vectors representative of other product data entries in the subset other than the product data entry to compute respective vector similarity metrics. A pooled semantic similarity metric for the product data entry based on the computed respective vector similarity metrics. It is determined whether the product data entry is an outlier in the subset based on the pooled semantic similarity metric. A notification is transmitted to a client device of a user based on the determination.
-
公开(公告)号:US20240070742A1
公开(公告)日:2024-02-29
申请号:US17894839
申请日:2022-08-24
Applicant: Maplebear Inc. (dba Instacart)
Inventor: Saurav Manchanda , Min Xie , Gordon McCreight , Jonathan Newman
IPC: G06Q30/06 , G06F16/56 , G06F16/903 , G06F40/166 , G06F40/284 , G06F40/30 , G06Q30/02
CPC classification number: G06Q30/0629 , G06F16/56 , G06F16/90344 , G06F40/166 , G06F40/284 , G06F40/30 , G06Q30/0201
Abstract: A server receives a plurality of product data entries from a plurality of retailer computing systems. Each product data entry includes a product identifier uniquely identifying a corresponding physical product and descriptive data of the corresponding physical product. A subset of the plurality of product data entries having a same product identifier is determined. An embedding vector representative of a product data entry in the subset is pairwise compared with each of respective embedding vectors representative of other product data entries in the subset other than the product data entry to compute respective vector similarity metrics. A pooled semantic similarity metric for the product data entry based on the computed respective vector similarity metrics. It is determined whether the product data entry is an outlier in the subset based on the pooled semantic similarity metric. A notification is transmitted to a client device of a user based on the determination.
-
公开(公告)号:US20230058829A1
公开(公告)日:2023-02-23
申请号:US17407158
申请日:2021-08-19
Applicant: Maplebear Inc. (dba Instacart)
Inventor: Shih-Ting Lin , Jonathan Newman , Min Xie , Haixun Wang
Abstract: An online concierge system receives unstructured data describing items offered for purchase by various warehouses. To generate attributes for products from the unstructured data, the online concierge system extracts candidate values for attributes from the unstructured data through natural language processing. One or more users associate a subset candidate values with corresponding attributes, and the online concierge system clusters the remaining candidate values with the candidate values of the subset associated with attributes. One or more users provide input on the accuracy of the generated clusters. The candidate values are applied as labels to items by the online concierge system, which uses the labeled items as training data for an attribute extraction model to predict values for one or more attributes from unstructured data about an item.
-
-
-
-
-