Invention Grant
- Patent Title: Deduplication ratio estimation using an expandable basis set
-
Application No.: US15600880Application Date: 2017-05-22
-
Publication No.: US10740296B2Publication Date: 2020-08-11
- Inventor: Danny Harnik , Ronen I. Kat , Ety Khaitzin , Sergey Marenkov
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Alexander G. Jochym
- Main IPC: G06F16/174
- IPC: G06F16/174 ; G06F16/23 ; G06F9/455 ; G06F9/451

Abstract:
A computer-implemented method includes receiving a set of basis fingerprints corresponding to image chunks within a basis set of image regions wherein each image region within the basis set of image regions comprises one or more image chunks, and generating a fingerprint for each image chunk of a plurality of selected image chunks within an unprocessed region of a machine image to produce a plurality of sampled fingerprints. The method also includes determining a similarity metric for the unprocessed region from the sampled fingerprints and the basis fingerprints, comparing the similarity metric for the unprocessed region with a selected threshold, and including the unprocessed region within the basis set of image regions in response to determining that the similarity metric is less than the selected threshold. A corresponding computer program product and computer system are also disclosed herein.
Public/Granted literature
- US20170262468A1 DEDUPLICATION RATIO ESTIMATION USING AN EXPANDABLE BASIS SET Public/Granted day:2017-09-14
Information query