Invention Grant
- Patent Title: Estimating number of distinct values in a data set using machine learning
-
Application No.: US16877882Application Date: 2020-05-19
-
Publication No.: US11620547B2Publication Date: 2023-04-04
- Inventor: Tomas Karnagel , Onur Kocberber , Farhan Tauheed , Nipun Agarwal
- Applicant: Oracle International Corporation
- Applicant Address: US CA Redwood Shores
- Assignee: Oracle International Corporation
- Current Assignee: Oracle International Corporation
- Current Assignee Address: US CA Redwood Shores
- Agency: Hickman Becker Bingham Ledesma LLP
- Main IPC: G06N5/04
- IPC: G06N5/04 ; G06N20/00

Abstract:
Techniques for estimating the number of distinct values in a data set using machine learning are provided. In one technique, a sample of a data set is retrieved where the sample is a strict subset of the data set. The sample is analyzed to identify feature values of multiple features of the sample. The feature values are inserted into a machine-learned model that computes a prediction regarding a number of distinct values in the data set. An estimated number of distinct values that is based on the prediction is stored in association with the data set.
Public/Granted literature
- US20210365805A1 ESTIMATING NUMBER OF DISTINCT VALUES IN A DATA SET USING MACHINE LEARNING Public/Granted day:2021-11-25
Information query