Machine learning to improve caching efficiency in a storage system

Invention Grant

US11693570B2 Machine learning to improve caching efficiency in a storage system 有权

Please log in to see more content

Patent Title: Machine learning to improve caching efficiency in a storage system
Application No.: US17243973

Application Date: 2021-04-29
Publication No.: US11693570B2

Publication Date: 2023-07-04
Inventor: Vamsi Vankamamidi , Shaul Dar
Applicant: EMC IP Holding Company LLC
Applicant Address: US MA Hopkinton
Assignee: EMC IP Holding Company LLC
Current Assignee: EMC IP Holding Company LLC
Current Assignee Address: US MA Hopkinton
Agency: Daly, Crowley, Mofford & Durkee LLP
Main IPC: G06F3/06
IPC: G06F3/06 ; G06F12/0891

Machine learning to improve caching efficiency in a storage system

Abstract:

A system and method improve caching efficiency in a data storage system by performing machine learning processes on metadata relating to extents of data blocks, rather than individual blocks themselves. Thus, once the storage devices are divided into extents, various metadata regarding access to the blocks within each extent are aggregated, and per-extent features are extracted. These features are used to train a data regression model that is subsequently used to infer a most likely “hotness” value for each extent at a future time. These predicted values, which may be further classified as e.g. “hot”, “warm”, and “cold” using thresholds, are used to implement the cache replacement policy. Embodiments scale to large and multi-layered caches, and may avoid common caching problems like thrashing, by adjusting the extent size. Policy goal functions may be optimized by dynamically adjusting the classification thresholds.

Public/Granted literature

US20220350484A1 MACHINE LEARNING TO IMPROVE CACHING EFFICIENCY IN A STORAGE SYSTEM Public/Granted day:2022-11-03

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F3/00	用于将所要处理的数据转变成为计算机能够处理的形式的输入装置；用于将数据从处理机传送到输出设备的输出装置，例如，接口装置
G06F3/06	.来自记录载体的数字输入，或者到记录载体上去的数字输出