Dynamic kernel slicing for VGPU sharing in serverless computing systems

Invention Grant

US11113782B2 Dynamic kernel slicing for VGPU sharing in serverless computing systems 有权

Please log in to see more content

Patent Title: Dynamic kernel slicing for VGPU sharing in serverless computing systems
Application No.: US16601831

Application Date: 2019-10-15
Publication No.: US11113782B2

Publication Date: 2021-09-07
Inventor: Chandra Prakash , Anshuj Garg , Uday Pundalik Kurkure , Hari Sivaraman , Lan Vu , Sairam Veeraswamy
Applicant: VMware, Inc.
Applicant Address: US CA Palo Alto
Assignee: VMware, Inc.
Current Assignee: VMware, Inc.
Current Assignee Address: US CA Palo Alto
Agency: Thomas | Horstemeyer, LLP
Main IPC: G06T1/20
IPC: G06T1/20 ; G06F9/455 ; G06F9/48

Dynamic kernel slicing for VGPU sharing in serverless computing systems

Abstract:

Various examples are disclosed for dynamic kernel slicing for virtual graphics processing unit (vGPU) sharing in serverless computing systems. A computing device is configured to provide a serverless computing service, receive a request for execution of program code in the serverless computing service in which a plurality of virtual graphics processing units (vGPUs) are used in the execution of the program code, determine a slice size to partition a compute kernel of the program code into a plurality of sub-kernels for concurrent execution by the vGPUs, the slice size being determined for individual ones of the sub-kernels based on an optimization function that considers a load on a GPU, determine an execution schedule for executing the individual ones of the sub-kernels on the vGPUs in accordance with a scheduling policy, and execute the sub-kernels on the vGPUs as partitioned in accordance with the execution schedule.

Public/Granted literature

US20210110506A1 DYNAMIC KERNEL SLICING FOR VGPU SHARING IN SERVERLESS COMPUTING SYSTEMS Public/Granted day:2021-04-15

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06T	一般的图像数据处理或产生
G06T1/00	通用图像数据处理
G06T1/20	.处理器架构; 处理器配置，例如流水线