Invention Grant
- Patent Title: Selecting candidate rows for deduplication
- Patent Title (中): 选择重复数据删除的候选行
-
Application No.: US13593508Application Date: 2012-08-23
-
Publication No.: US08719236B2Publication Date: 2014-05-06
- Inventor: Yaron Zinar , Efim Hudis , Yifat Orlin , Gal Novik , Yuri Gurevich , Gad Peleg
- Applicant: Yaron Zinar , Efim Hudis , Yifat Orlin , Gal Novik , Yuri Gurevich , Gad Peleg
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agent Nicholas Chen; Kate Drakos; Micky Minhas
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F3/06

Abstract:
The present invention extends to methods, systems, and computer program products for selecting candidate records for deduplication from a table. A table can be processed to compute an inverse index for each field of the table. A deduplication algorithm can traverse the inverse indices in accordance with a flexible user-defined policy to identify candidate records for deduplication. Both exact matches and approximate matches can be found.
Public/Granted literature
- US20140059015A1 SELECTING CANDIDATE ROWS FOR DEDUPLICATION Public/Granted day:2014-02-27
Information query