Multiple model injection for a deployment cluster

Invention Grant

US11206316B2 Multiple model injection for a deployment cluster 有权

Please log in to see more content

Patent Title: Multiple model injection for a deployment cluster
Application No.: US16809414

Application Date: 2020-03-04
Publication No.: US11206316B2

Publication Date: 2021-12-21
Inventor: Kartik Mathur
Applicant: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
Applicant Address: US TX Houston
Assignee: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
Current Assignee: HEWLETT PACKARD ENTERPRISE DEVELOPMENT LP
Current Assignee Address: US TX Houston
Agency: Sheppard Mullin Richter & Hampton LLP
Main IPC: H04L29/08
IPC: H04L29/08 ; G06F9/455 ; G06N5/04

Multiple model injection for a deployment cluster

Abstract:

Systems and methods are provided for servicing inference request by one of multiple machine learning models attached to a deployment cluster. The API server of a deployment cluster is not tightly coupled to any of multiple machine learning models attached to the deployment cluster. Upon receiving an inference request, the deployment cluster can retrieve the configuration parameters, including serialization formatting, for a target model identified in the inference request. The deployment cluster can utilize the retrieved parameters to service the inference request and return the results to a business system application.

Public/Granted literature

US20210281662A1 MULTIPLE MODEL INJECTION FOR A DEPLOYMENT CLUSTER Public/Granted day:2021-09-09

Information query

Espacenet