- Patent Title: Automatically scaling compute resources for heterogeneous workloads
-
Application No.: US16199014Application Date: 2018-11-23
-
Publication No.: US10761893B1Publication Date: 2020-09-01
- Inventor: Vivek Bhadauria , Praveenkumar Udayakumar , Jonathan Andrew Hedley , Vasant Manohar , Andrea Olgiati , Rakesh Madhavan Nambiar , Gowtham Jeyabalan , Shubham Chandra Gupta , Palak Mehta
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Nicholson De Vos Webster & Elliott LLP
- Main IPC: G06F9/46
- IPC: G06F9/46 ; G06F9/50

Abstract:
Techniques are described for automatically scaling (or “auto scaling”) compute resources—for example, virtual machine (VM) instances, containers, or standalone servers—used to support execution of service-oriented software applications and other types of applications that may process heterogeneous workloads. The resource requirements for a software application can be approximated by measuring “worker pool” utilization of instances of each service, where a worker pool represents a number of requests that the service can process concurrently. A scaling service can thus be configured to scale the compute instances provisioned for a service in proportion to worker pool utilization, that is, compute instances can be added as the fleet's worker pools become more “busy,” while compute instances can be removed when worker pools become inactive.
Information query