Invention Grant
- Patent Title: Data set discovery engine comprising relativistic retriever
-
Application No.: US15074597Application Date: 2016-03-18
-
Publication No.: US10229186B1Publication Date: 2019-03-12
- Inventor: David Stephen Reiner , Nihar Nanda , Leonid Levkovich-Maslyuk , Andrey Abramov
- Applicant: EMC Corporation
- Applicant Address: US MA Hopkinton
- Assignee: EMC IP Holding Company LLC
- Current Assignee: EMC IP Holding Company LLC
- Current Assignee Address: US MA Hopkinton
- Agency: Ryan, Mason & Lewis, LLP
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
An apparatus in one embodiment comprises a processing platform implementing a data set discovery engine. The data set discovery engine comprises a data set indexer configured to generate similarity indexes for a plurality of data sets, and a relativistic retriever coupled to the data set indexer and configured to obtain a suitability template for a query and to execute the query against one or more of the similarity indexes based at least in part on the suitability template. A given one of the similarity indexes comprises at least first and second auxiliary information generated from respective ones of at least first and second different similarity measures of a plurality of different similarity measures. The first and second similarity measures comprise selected ones of the plurality of different similarity measures that are supported by the data set discovery engine with the supported similarity measures comprising both frequency-based and non-frequency-based similarity measures.
Information query