Optimizing tail latency via workload and resource redundancy in cloud
Abstract:
A plurality of requests are received for computing processing. At least some of the plurality of requests are replicated. The requests are replicated based on a fractional replication factor. Each received request and each replicated request are transmitted to a computer resource for processing. At least some embodiments provide the capability for meeting tail latency targets with improved performance and reduced cost.
Information query
Patent Agency Ranking
0/0