Invention Grant
US08280915B2 Binning predictors using per-predictor trees and MDL pruning
有权
使用每预测树和MDL修剪的binning预测变量
- Patent Title: Binning predictors using per-predictor trees and MDL pruning
- Patent Title (中): 使用每预测树和MDL修剪的binning预测变量
-
Application No.: US11344185Application Date: 2006-02-01
-
Publication No.: US08280915B2Publication Date: 2012-10-02
- Inventor: Mahesh Jagannath , Chitra Bhagwat , Joseph Yarmus , Ari W. Mozes
- Applicant: Mahesh Jagannath , Chitra Bhagwat , Joseph Yarmus , Ari W. Mozes
- Applicant Address: US CA Redwood Shores
- Assignee: Oracle International Corporation
- Current Assignee: Oracle International Corporation
- Current Assignee Address: US CA Redwood Shores
- Agency: Murphy & King, P.C.
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
Binning of predictor values used for generating a data mining model provides useful reduction in memory footprint and computation during the computationally dominant decision tree build phase, but reduces the information loss of the model and reduces the introduction of false information artifacts. A method of binning data in a database for data mining modeling in a database system, the data stored in a database table in the database system, the data mining modeling having selected at least one predictor and one target for the data, the data including a plurality of values of the predictor and a plurality of values of the target, the method comprises constructing a binary tree for the predictor that splits the values of the predictor into a plurality of portions, pruning the binary tree, and defining as bins of the predictor leaves of the tree that remain after pruning, each leaf of the tree representing a portion of the values of the predictor.
Public/Granted literature
- US20070185896A1 Binning predictors using per-predictor trees and MDL pruning Public/Granted day:2007-08-09
Information query