Invention Grant
- Patent Title: Duplicate document detection
- Patent Title (中): 重复文件检测
-
Application No.: US13612840Application Date: 2012-09-13
-
Publication No.: US08768940B2Publication Date: 2014-07-01
- Inventor: Joshua Alspector , Abdur R. Chowdhury , Aleksander Kolcz
- Applicant: Joshua Alspector , Abdur R. Chowdhury , Aleksander Kolcz
- Applicant Address: US CA Menlo Park
- Assignee: Facebook, Inc.
- Current Assignee: Facebook, Inc.
- Current Assignee Address: US CA Menlo Park
- Agency: Keller Jolley Preece
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
In a single-signature duplicate document system, a secondary set of attributes is used in addition to a primary set of attributes so as to improve the precision of the system. When the projection of a document onto the primary set of attributes is below a threshold, then a secondary set of attributes is used to supplement the primary lexicon so that the projection is above the threshold.
Public/Granted literature
- US20130007026A1 Reliability of Duplicate Document Detection Algorithms Public/Granted day:2013-01-03
Information query