Invention Grant
- Patent Title: Dynamically provisioning and scaling graphic processing units for data analytic workloads in a hardware cloud
-
Application No.: US15093965Application Date: 2016-04-08
-
Publication No.: US09916636B2Publication Date: 2018-03-13
- Inventor: Min Li , John Alan Bivens , Koushik K. Das , Ruchi Mahindru , Harigovind V. Ramasamy , Yaoping Ruan , Valentina Salapura , Eugen Schenfeld
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Jeffrey S. LaBaw; David H. Judson
- Main IPC: G06F9/50
- IPC: G06F9/50 ; G06T1/20 ; G06T15/00

Abstract:
Server resources in a data center are disaggregated into shared server resource pools, including a graphics processing unit (GPU) pool. Servers are constructed dynamically, on-demand and based on workload requirements, by allocating from these resource pools. According to this disclosure, GPU utilization in the data center is managed proactively by assigning GPUs to workloads in a fine granularity and agile way, and de-provisioning them when no longer needed. In this manner, the approach is especially advantageous to automatically provision GPUs for data analytic workloads. The approach thus provides for a “micro-service” enabling data analytic workloads to automatically and transparently use GPU resources without providing (e.g., to the data center customer) the underlying provisioning details. Preferably, the approach dynamically determines the number and the type of GPUs to use, and then during runtime auto-scales the GPUs based on workload.
Public/Granted literature
Information query