-
公开(公告)号:US20250069298A1
公开(公告)日:2025-02-27
申请号:US18236346
申请日:2023-08-21
Applicant: Maplebear Inc.
Inventor: Prithvishankar Srinivasan , Shih-Ting Lin , Min Xie , Shishir Kumar Prasad , Yuanzheng Zhu , Katie Ann Forbes
IPC: G06T11/60 , G06F16/55 , G06F16/583 , G06Q30/0601 , G06T11/20
Abstract: An online concierge system trains a fine-tuned generative image model for distinct categories of items based on a generative image model that takes a textual query as input and outputs and an associated image. Training of the fine-tuned generative image model is additionally based on a small set of representative images associated with the various categories, as well as textual tokens associated with the categories. Once trained, the fine-tuned generative image model can be used to generate realistic representative images for items in a database of the online concierge system that are lacking associated images. The fine-tuned model permits the generation of different variants of an item, such as different quantities or amounts, different packaging or packing density, and the like.
-
2.
公开(公告)号:US20250037323A1
公开(公告)日:2025-01-30
申请号:US18785665
申请日:2024-07-26
Applicant: Maplebear Inc.
Inventor: Prithvishankar Srinivasan , Shih-Ting Lin , Yuanzheng Zhu , Min Xie , Shishir Kumar Prasad , Shrikar Archak , Karuna Ahuja
IPC: G06T11/00 , G06T5/70 , G06V10/764
Abstract: An online system performs a task in conjunction with the model serving system or the interface system. The system generates a first prompt for input to a machine-learned language model, which specifies contextual information and a first request to generate a theme. The system provides the first prompt to a model serving system for execution by the machine-learned language model, receives a first response, and generates a second prompt. The second prompt specifies the theme and a second request to generate a third prompt for input to an image generation model that includes a third request to generate one or more images of one or more items associated with the theme. The system receives the third prompt by executing the model on the second prompt, provides the third prompt to the image generation model, and receives one or more images for presentation.
-
公开(公告)号:US20240289861A1
公开(公告)日:2024-08-29
申请号:US18587655
申请日:2024-02-26
Applicant: Maplebear Inc.
Inventor: Haixun Wang , Tejaswi Tenneti , Taesik Na , Yuanzheng Zhu , Vinesh Reddy Gudla , Lee Cohn
IPC: G06Q30/0601
CPC classification number: G06Q30/0631 , G06Q30/0627 , G06Q30/0635 , G06Q30/0643
Abstract: Responsive to an input query from a user, an online system presents a list of recommended items that are related to the input query. The input query may be formulated as a natural language query. The online system performs an inference task in conjunction with the model serving system to generate one or more additional queries that are related to the input query and/or are otherwise related to the recommended items presented in response to the input query. The additional queries may be presented to the user in conjunction with the list of recommended items.
-
公开(公告)号:US20250086685A1
公开(公告)日:2025-03-13
申请号:US18243600
申请日:2023-09-07
Applicant: Maplebear Inc.
Inventor: Shubhanshu Mishra , Gia Young , Jennie Morgan Burger , Joseph Olivier , Brent Luna , Yuanzheng Zhu , Mackenzie Cala , Armand Raquel-Santos , David Zandman , Mia Martinez Barnett , Callie Bleckner , Mohammad Abdul-Rahim
IPC: G06Q30/0601 , G06V20/50
Abstract: An online concierge system assists users in identifying additional information about items in an image. Image regions are identified in the image that may correspond to unknown items and an item search space is determined for detecting items in the image regions based on a context of the image, such as items in a warehouse or a list of items delivered to a customer. The identified items are used to retrieve relevant item information that is included in a prompt for a language model to extract relevant information for the item. As such, the process may automatically process the image into relevant textual information about the pictured items. Applications may be used to assist vision-impaired users in distinguishing delivered items or quickly identifying and evaluating relevant information about items.
-
-
-