Invention Grant
- Patent Title: Similarity-based searching
- Patent Title (中): 基于相似性的搜索
-
Application No.: US12059318Application Date: 2008-03-31
-
Publication No.: US08015190B1Publication Date: 2011-09-06
- Inventor: Roberto J. Bayardo , Yiming Ma , Ramakrishnan Srikant
- Applicant: Roberto J. Bayardo , Yiming Ma , Ramakrishnan Srikant
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Pairs of similar vectors (or objects) in a set of vectors (or objects) are identified. A comparison vector x in a set of vectors is identified; a size threshold is determined such that if a similarity between the vector x and a vector y in the set of vectors is equal to or greater than a similarity threshold, then the vector y has a size at least equal to the size threshold, the size of the candidate vector y being determined based on a number of non-zero features in the vector y. A vector having a size less than the size threshold is removed from the set of candidate vectors.
Information query