Using tiered storage and ISTIO to satisfy SLA in model serving and updates
Abstract:
Systems, methods, and computer-readable storage media are provided for storing machine learned models in a tiered storage. The model serving network evaluates where the models should be stored based on the model corresponding service level agreement. The model is generally stored at the lowest tiered storage device that is still capable of satisfying the model's service level agreement. In this way, the model serving network aims to store data that achieves the cheapest cost.
Information query
Patent Agency Ranking
0/0