Categorical feature enhancement mechanism for gradient boosting decision tree

Invention Grant

US11699106B2 Categorical feature enhancement mechanism for gradient boosting decision tree 有权

Please log in to see more content

Patent Title: Categorical feature enhancement mechanism for gradient boosting decision tree
Application No.: US16355348

Application Date: 2019-03-15
Publication No.: US11699106B2

Publication Date: 2023-07-11
Inventor: Mohammad Zeeshan Siddiqui , Thomas Finley , Sarthak Shah
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Agency: Schwegman Lundberg & Woessner, P.A.
Main IPC: G06N20/20
IPC: G06N20/20 ; G06F18/213 ; G06F18/2451

Categorical feature enhancement mechanism for gradient boosting decision tree

Abstract:

A computer implemented method of generating a gradient boosting decision tree for obtaining predictions includes finding split points by sorting variable values of a feature by their gradient during training of the gradient boosting decision tree, performing a linear search to find a subset of variables with maximum split gain, and modifying a node of the gradient boosting decision tree to have multiple split points on the node for a feature as a function of the linear search. In a further example, a computer implemented method of controlling overfitting in a gradient boosting decision tree includes combining values of low population feature values into a virtual bin, fanning out the virtual bin into feature values having a low population, and including the low population feature values into multiple split points on a node of the gradient boosting decision tree.

Public/Granted literature

US20200293952A1 CATEGORICAL FEATURE ENHANCEMENT MECHANISM FOR GRADIENT BOOSTING DECISION TREE Public/Granted day:2020-09-17

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习
G06N20/20	.•集成学习