- Patent Title: Service level agreement-based multi-hardware accelerated inference
-
Application No.: US17066400Application Date: 2020-10-08
-
Publication No.: US11356339B2Publication Date: 2022-06-07
- Inventor: Francesc Guim Bernat , Kshitij Arun Doshi , Suraj Prabhakaran , Raghu Kondapalli , Alexander Bachmutsky
- Applicant: Intel Corporation
- Applicant Address: US CA Santa Clara
- Assignee: Intel Corporation
- Current Assignee: Intel Corporation
- Current Assignee Address: US CA Santa Clara
- Agency: Schwegman Lundberg & Woessner, P.A.
- Main IPC: H04L29/08
- IPC: H04L29/08 ; G06F17/30 ; G06F11/34 ; G06F11/30 ; H04L41/5019 ; H04L67/12 ; H04L67/63 ; H04L67/61 ; H04L41/0806 ; H04L41/5041 ; G06N5/04

Abstract:
Various systems and methods for implementing a service-level agreement (SLA) apparatus receive a request from a requester via a network interface of the gateway, the request comprising an inference model identifier that identifies a handler of the request, and a response time indicator. The response time indicator relates to a time within which the request is to be handled indicates an undefined time within which the request is to be handled. The apparatus determines a network location of a handler that is a platform or an inference model to handle the request consistent with the response time indicator, and routes the request to the handler at the network location.
Public/Granted literature
- US20210099362A1 SERVICE LEVEL AGREEMENT-BASED MULTI-HARDWARE ACCELERATED INFERENCE Public/Granted day:2021-04-01
Information query