Invention Grant
US08280915B2 Binning predictors using per-predictor trees and MDL pruning 有权
使用每预测树和MDL修剪的binning预测变量

Binning predictors using per-predictor trees and MDL pruning
Abstract:
Binning of predictor values used for generating a data mining model provides useful reduction in memory footprint and computation during the computationally dominant decision tree build phase, but reduces the information loss of the model and reduces the introduction of false information artifacts. A method of binning data in a database for data mining modeling in a database system, the data stored in a database table in the database system, the data mining modeling having selected at least one predictor and one target for the data, the data including a plurality of values of the predictor and a plurality of values of the target, the method comprises constructing a binary tree for the predictor that splits the values of the predictor into a plurality of portions, pruning the binary tree, and defining as bins of the predictor leaves of the tree that remain after pruning, each leaf of the tree representing a portion of the values of the predictor.
Public/Granted literature
Information query
Patent Agency Ranking
0/0