Scalable distributed computing system for determining exact median and other quantiles in big data applications
Abstract:
A computing system for big data processing includes: a first node, configured to execute a central driver program; and a plurality of data and/or computing nodes, configured to store a plurality of data blocks corresponding to a data set. The first node and the plurality of data and/or computing nodes form a distributed computing environment configured for determining an exact value for one or more desired quantiles for the data set.
Information query
Patent Agency Ranking
0/0