Invention Grant
- Patent Title: Scalable streaming decision tree learning
-
Application No.: US14833397Application Date: 2015-08-24
-
Publication No.: US10671924B2Publication Date: 2020-06-02
- Inventor: Wei Shan Dong , Peng Gao , Guo Qiang Hu , Chang Sheng Li , Xu Liang Li , Chun Yang Ma , Zhi Wang , Xin Zhang
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Cantor Colburn LLP
- Agent Joseph Petrokaitis
- Main IPC: G06N5/02
- IPC: G06N5/02

Abstract:
In one embodiment, a computer-implemented method includes receiving training data including a plurality of records, each record having a plurality of attributes. The training data is horizontally parallelized across two or more processing elements. This horizontal parallelizing includes dividing the training data into two or more subsets of records; assigning each subset of records to a corresponding processing element of the two or more processing elements; transmitting each subset of records to its assigned processing element; and sorting, at the two or more processing elements, the two or more subsets of records to two or more candidate leaves of a decision tree. The output from horizontally parallelizing is converted into input for vertically parallelizing the training data. The training data is vertically parallelized across the two or more processing elements. The decision tree is grown based at least in part on the horizontally parallelizing, the converting, and the vertically parallelizing.
Public/Granted literature
- US20170061318A1 SCALABLE STREAMING DECISION TREE LEARNING Public/Granted day:2017-03-02
Information query