Invention Grant
- Patent Title: Systems and methods for quantile determination in a distributed data system using sampling
-
Application No.: US15212010Application Date: 2016-07-15
-
Publication No.: US09703852B2Publication Date: 2017-07-11
- Inventor: Guy Blanc , Georges H. Guirguis , Xiangqian Hu , Guixian Lin , Scott Pope
- Applicant: SAS Institute Inc.
- Applicant Address: US NC Cary
- Assignee: SAS INSTITUTE INC.
- Current Assignee: SAS INSTITUTE INC.
- Current Assignee Address: US NC Cary
- Agency: Kilpatrick Townsend & Stockton LLP
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
In accordance with the teachings described herein, systems and methods are provided for estimating or determining quantiles for data stored in a distributed system. In one embodiment, an instruction is received to estimate or determine a specified quantile for a variate in a set of data stored at a plurality of nodes in the distributed system. A plurality of data bins for the variate are defined that are each associated with a different range of data values in the set of data. Lower and upper quantile bounds for each of the plurality of data bins are determined based on the total number of data values that fall within each of the plurality of data bins. The specified quantile is estimated or determined based on an identified one of the plurality of data bins that includes the specified quantile based on the lower and upper quantile bounds.
Public/Granted literature
- US20160350396A1 SYSTEMS AND METHODS FOR QUANTILE DETERMINATION IN A DISTRIBUTED DATA SYSTEM USING SAMPLING Public/Granted day:2016-12-01
Information query