Service level agreement-based multi-hardware accelerated inference

Invention Grant

US11356339B2 Service level agreement-based multi-hardware accelerated inference 有权

Please log in to see more content

Patent Title: Service level agreement-based multi-hardware accelerated inference
Application No.: US17066400

Application Date: 2020-10-08
Publication No.: US11356339B2

Publication Date: 2022-06-07
Inventor: Francesc Guim Bernat , Kshitij Arun Doshi , Suraj Prabhakaran , Raghu Kondapalli , Alexander Bachmutsky
Applicant: Intel Corporation
Applicant Address: US CA Santa Clara
Assignee: Intel Corporation
Current Assignee: Intel Corporation
Current Assignee Address: US CA Santa Clara
Agency: Schwegman Lundberg & Woessner, P.A.
Main IPC: H04L29/08
IPC: H04L29/08 ; G06F17/30 ; G06F11/34 ; G06F11/30 ; H04L41/5019 ; H04L67/12 ; H04L67/63 ; H04L67/61 ; H04L41/0806 ; H04L41/5041 ; G06N5/04

Service level agreement-based multi-hardware accelerated inference

Abstract:

Various systems and methods for implementing a service-level agreement (SLA) apparatus receive a request from a requester via a network interface of the gateway, the request comprising an inference model identifier that identifies a handler of the request, and a response time indicator. The response time indicator relates to a time within which the request is to be handled indicates an undefined time within which the request is to be handled. The apparatus determines a network location of a handler that is a platform or an inference model to handle the request consistent with the response time indicator, and routes the request to the handler at the network location.

Public/Granted literature

US20210099362A1 SERVICE LEVEL AGREEMENT-BASED MULTI-HARDWARE ACCELERATED INFERENCE Public/Granted day:2021-04-01

Information query

Espacenet