Invention Grant
- Patent Title: System and method for data mining and similarity estimation
-
Application No.: US16513803Application Date: 2019-07-17
-
Publication No.: US10970296B2Publication Date: 2021-04-06
- Inventor: Konstantin Kutzkov , Mohamed Ahmed
- Applicant: NEC CORPORATION
- Applicant Address: JP Tokyo
- Assignee: NEC CORPORATION
- Current Assignee: NEC CORPORATION
- Current Assignee Address: JP Tokyo
- Agency: Leydig, Voit & Mayer, Ltd.
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06F16/2458 ; G06F16/2455 ; G06F16/2457

Abstract:
A method for data mining includes receiving input vectors and converting them into corresponding sketch feature vectors each having a number of output dimensions that is less than a number of dimensions of the corresponding input vector. Each sketch feature vector is compared against parameters and a decision loop generates results of similarities based on the comparisons. An estimate of cosine similarity or Pearson correlation of the input vectors is obtained based on estimates of an inner product of two input vectors and a 2-norm vector of an input vector. The estimates are obtained using respective hash tables for each input vector having a number of entries up to the number of output dimensions of the sketch feature vector. A decision is provided based on the results of the similarities and an application of the data mining such that the decision is implemented by the application.
Public/Granted literature
- US20190340176A1 SYSTEM AND METHOD FOR DATA MINING AND SIMILARITY ESTIMATION Public/Granted day:2019-11-07
Information query