Invention Grant
- Patent Title: Method for determining near duplicate data objects
- Patent Title (中): 确定近重复数据对象的方法
-
Application No.: US11572441Application Date: 2005-07-07
-
Publication No.: US08015124B2Publication Date: 2011-09-06
- Inventor: Amir Milo , Yiftach Ravid
- Applicant: Amir Milo , Yiftach Ravid
- Applicant Address: IL
- Assignee: Equivio Ltd
- Current Assignee: Equivio Ltd
- Current Assignee Address: IL
- Agency: Brooks Kushman P.C.
- International Application: PCT/IL2005/000726 WO 20050707
- International Announcement: WO2006/008733 WO 20060126
- Main IPC: G06F15/18
- IPC: G06F15/18

Abstract:
A system for determining that a document B is a candidate for near duplicate to a document A with a given similarity level th. The system includes a storage for providing two different functions on the documents, each function having a numeric function value. The system further includes a processor associated with the storage and configured to determine that the document B is a candidate for near duplicate to the document A, if a condition is met. The condition includes: for any function ƒi from among the two functions, ƒi(A)−ƒi(B)≦δi(ƒ,A,th).
Public/Granted literature
- US20090028441A1 METHOD FOR DETERMINING NEAR DUPLICATE DATA OBJECTS Public/Granted day:2009-01-29
Information query