Invention Grant
US09110936B2 Using index partitioning and reconciliation for data deduplication
有权
使用索引分区和对帐进行重复数据删除
- Patent Title: Using index partitioning and reconciliation for data deduplication
- Patent Title (中): 使用索引分区和对帐进行重复数据删除
-
Application No.: US12979748Application Date: 2010-12-28
-
Publication No.: US09110936B2Publication Date: 2015-08-18
- Inventor: Jin Li , Sudipta Sengupta , Ran Kalach , Ronakkumar N. Desai , Paul Adrian Oltean , James Robert Benton
- Applicant: Jin Li , Sudipta Sengupta , Ran Kalach , Ronakkumar N. Desai , Paul Adrian Oltean , James Robert Benton
- Applicant Address: US WA Redmond
- Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
- Current Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
- Current Assignee Address: US WA Redmond
- Agent Henry Gabryjelski; Stein Dolan; Micky Minhas
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
The subject disclosure is directed towards a data deduplication technology in which a hash index service's index is partitioned into subspace indexes, with less than the entire hash index service's index cached to save memory. The subspace index is accessed to determine whether a data chunk already exists or needs to be indexed and stored. The index may be divided into subspaces based on criteria associated with the data to index, such as file type, data type, time of last usage, and so on. Also described is subspace reconciliation, in which duplicate entries in subspaces are detected so as to remove entries and chunks from the deduplication system. Subspace reconciliation may be performed at off-peak time, when more system resources are available, and may be interrupted if resources are needed. Subspaces to reconcile may be based on similarity, including via similarity of signatures that each compactly represents the subspace's hashes.
Public/Granted literature
- US20120166401A1 Using Index Partitioning and Reconciliation for Data Deduplication Public/Granted day:2012-06-28
Information query