Invention Grant
US09547662B2 Digest retrieval based on similarity search in data deduplication
有权
基于重复数据删除中的相似搜索的摘要检索
- Patent Title: Digest retrieval based on similarity search in data deduplication
- Patent Title (中): 基于重复数据删除中的相似搜索的摘要检索
-
Application No.: US13839581Application Date: 2013-03-15
-
Publication No.: US09547662B2Publication Date: 2017-01-17
- Inventor: Shay H. Akirav , Lior Aronovich , Shira Ben-Dor , Michael Hirsch , Ofer Leneman
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Griffiths & Seaton PLLC
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
For digest retrieval based on similarity search in deduplication processing in a data deduplication system using a processor device in a computing environment, input data is partitioned into fixed sized data chunks. Similarity elements and digest block boundaries and digest values are calculated for each of the fixed sized data chunks. Matching similarity elements are searched for in a search structure containing the similarity elements for each of the fixed sized data chunks in a repository of data. Positions of similar data are located in the repository. The positions of the similar data are used to locate and load into the memory stored digest values and corresponding stored digest block boundaries of the similar data in the repository. The digest values and the corresponding digest block boundaries of the input data are matched with the stored digest values and the corresponding stored digest block boundaries to find data matches.
Public/Granted literature
- US20140279951A1 DIGEST RETRIEVAL BASED ON SIMILARITY SEARCH IN DATA DEDUPLICATION Public/Granted day:2014-09-18
Information query