Dynamic GPU-enabled virtual machine provisioning across cloud providers
Abstract:
A method of provisioning virtual machines (VMs) includes: providing a VM pool that includes a graphics processing unit (GPU)-optimized VM and a non-GPU-optimized VM operating in different clouds. A control plane can receive an indication that a user has submitted a workload request, determine whether a GPU-optimized VM is available and instruct the non-GPU-optimized VM to send the workload to the GPU-optimized VM in a peer-to-peer manner. The GPU-optimized VM computes the workload and returns a result to the requesting VM. The control plane can instantiate a new GPU-optimized VM (or terminate it when the workload is complete) to dynamically maintain a desired number of available GPU-optimized VMs.
Information query
Patent Agency Ranking
0/0