Data storage system, process, and computer program for de-duplication of distributed data in a scalable cluster system

Invention Grant

US10929042B2 Data storage system, process, and computer program for de-duplication of distributed data in a scalable cluster system 有权

Please log in to see more content

Patent Title: Data storage system, process, and computer program for de-duplication of distributed data in a scalable cluster system
Application No.: US16304727

Application Date: 2016-10-20
Publication No.: US10929042B2

Publication Date: 2021-02-23
Inventor: Yuko Matsui , Mitsuo Hayasaka , Christopher James Aston , Jonathan Smith , Daniel Picken , James Gibbs , Simon Crosland
Applicant: HITACHI, LTD.
Applicant Address: JP Tokyo
Assignee: HITACHI, LTD.
Current Assignee: HITACHI, LTD.
Current Assignee Address: JP Tokyo
Agency: Mattingly & Malur, PC
International Application: PCT/US2016/057849 WO 20161020
International Announcement: WO2018/075042 WO 20180426
Main IPC: G06F16/00
IPC: G06F16/00 ; G06F3/06 ; G06F16/215

Data storage system, process, and computer program for de-duplication of distributed data in a scalable cluster system

Abstract:

A data de-duplication in a distributed storage of data objects in a cluster system, in which plural data objects are distributed across a group of node apparatuses and stored in units of data blocks. Each metadata structure including a root metadata node and one or more direct metadata nodes, and optionally including one or more indirect metadata nodes; and a metadata object is stored for managing de-duplicated data blocks based on a metadata structure of the metadata object wherein at least one direct metadata node of the metadata structure of the metadata object includes a block reference pointing to a de-duplicated data block being associated with two or more data objects. Preferably, each of the metadata structures of the two or more data objects being associated with the de-duplicated data block includes a respective direct metadata node including an object reference to the metadata structure of the metadata object.

Public/Granted literature

US20190278504A1 DATA STORAGE SYSTEM, PROCESS, AND COMPUTER PROGRAM FOR DE-DUPLICATION OF DISTRIBUTED DATA IN A SCALABLE CLUSTER SYSTEM Public/Granted day:2019-09-12

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F16/00	信息检索；数据库结构；文件系统结构