Invention Grant
- Patent Title: Multiple model injection for a deployment cluster
-
Application No.: US16809414Application Date: 2020-03-04
-
Publication No.: US11206316B2Publication Date: 2021-12-21
- Inventor: Kartik Mathur
- Applicant: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
- Applicant Address: US TX Houston
- Assignee: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
- Current Assignee: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
- Current Assignee Address: US TX Houston
- Agency: Sheppard Mullin Richter & Hampton LLP
- Main IPC: H04L29/08
- IPC: H04L29/08 ; G06F9/455 ; G06N5/04

Abstract:
Systems and methods are provided for servicing inference request by one of multiple machine learning models attached to a deployment cluster. The API server of a deployment cluster is not tightly coupled to any of multiple machine learning models attached to the deployment cluster. Upon receiving an inference request, the deployment cluster can retrieve the configuration parameters, including serialization formatting, for a target model identified in the inference request. The deployment cluster can utilize the retrieved parameters to service the inference request and return the results to a business system application.
Public/Granted literature
- US20210281662A1 MULTIPLE MODEL INJECTION FOR A DEPLOYMENT CLUSTER Public/Granted day:2021-09-09
Information query