Invention Grant
- Patent Title: Sampling based data de-duplication
- Patent Title (中): 基于抽样的数据重复数据删除
-
Application No.: US13351192Application Date: 2012-01-16
-
Publication No.: US08442956B2Publication Date: 2013-05-14
- Inventor: Jeffrey Vincent Tofano
- Applicant: Jeffrey Vincent Tofano
- Applicant Address: US CA Santa Monica
- Assignee: Wells Fargo Capital Finance, LLC
- Current Assignee: Wells Fargo Capital Finance, LLC
- Current Assignee Address: US CA Santa Monica
- Agency: Ladas & Parry, LLP
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Example apparatus, methods, and computers perform sampling based data de-duplication. One example method controls a data de-duplication computer to compute a sampling sequence for a sub-block of data and to use the sampling sequence to locate a stored sub-block known to the data de-duplication computer. Upon finding a stored sub-block to compare to, the method includes controlling the data de-duplication computer to determine a degree of similarity (e.g., duplicate, very similar, somewhat similar, very dissimilar, completely dissimilar, x % similar) between the sub-block and the stored sub-block and to control whether and how the sub-block is stored and/or transmitted based on the degree of similarity. The degree of similarity can also control whether and how the data de-duplication computer updates a dedupe data structure(s) that stores information for finding groups of similarity sampling sequence related sub-blocks.
Public/Granted literature
- US20120233135A1 SAMPLING BASED DATA DE-DUPLICATION Public/Granted day:2012-09-13
Information query