Invention Grant
- Patent Title: Communication-efficient data parallel ensemble boosting
-
Application No.: US17114644Application Date: 2020-12-08
-
Publication No.: US11948056B2Publication Date: 2024-04-02
- Inventor: Rajesh Bordawekar , Tin Kam Ho
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: CANTOR COLBURN LLP
- Agent Jared Chaney
- Main IPC: G06N20/20
- IPC: G06N20/20 ; G06F9/52 ; G06N5/01

Abstract:
Data-parallel ensemble training using gradient boosted trees includes training an ensemble of trees. The training includes splitting a training dataset into several data portions. Each data portion is assigned to each thread group from a set of thread groups. The training further includes executing a stage, in which each thread group, in parallel, trains a respective ensemble of decision trees. Executing the stage includes performing, by each thread group, in parallel, machine learning operations for the respective ensemble of decision trees using the data portion assigned to each thread group. Further, each thread group validates, in parallel, the respective ensemble of decision trees using a data portion assigned to another thread group. Execution of the stage is repeated until a predetermined threshold is satisfied. Further, a prediction is inferenced using the ensemble of decision trees that is formed using the respective ensemble of trees from each of the thread groups.
Public/Granted literature
- US20220180253A1 COMMUNICATION-EFFICIENT DATA PARALLEL ENSEMBLE BOOSTING Public/Granted day:2022-06-09
Information query