Invention Grant
- Patent Title: Reliability of duplicate document detection algorithms
- Patent Title (中): 重复文件检测算法的可靠性
-
Application No.: US13185238Application Date: 2011-07-18
-
Publication No.: US08429178B2Publication Date: 2013-04-23
- Inventor: Joshua Alspector , Aleksander Kolcz , Abdur R. Chowdhury
- Applicant: Joshua Alspector , Aleksander Kolcz , Abdur R. Chowdhury
- Applicant Address: US CA Menlo Park
- Assignee: Facebook, Inc.
- Current Assignee: Facebook, Inc.
- Current Assignee Address: US CA Menlo Park
- Agency: Keller Jolley & Preece
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
In a single-signature duplicate document system, a secondary set of attributes is used in addition to a primary set of attributes so as to improve the precision of the system. When the projection of a document onto the primary set of attributes is below a threshold, then a secondary set of attributes is used to supplement the primary lexicon so that the projection is above the threshold.
Public/Granted literature
- US20110276646A1 RELIABILITY OF DUPLICATE DOCUMENT DETECTION ALGORITHMS Public/Granted day:2011-11-10
Information query