Invention Grant
- Patent Title: FPGA implementation of low latency architecture of XGBoost for inference and method therefor
-
Application No.: US17491689Application Date: 2021-10-01
-
Publication No.: US11748292B2Publication Date: 2023-09-05
- Inventor: Piyush Manavar , Manoj Nambiar
- Applicant: Tata Consultancy Services Limited
- Applicant Address: IN Mumbai
- Assignee: TATA CONSULTANCY SERVICES LIMITED
- Current Assignee: TATA CONSULTANCY SERVICES LIMITED
- Current Assignee Address: IN Mumbai
- Agency: Finnegan, Henderson, Farabow, Garrett & Dunner LLP
- Priority: IN 2121003639 2021.01.27
- Main IPC: G06F13/36
- IPC: G06F13/36 ; G06F13/42 ; G06N20/20

Abstract:
Various embodiments disclosed herein provides method and system for low latency FPGA based system for inference such as recommendation models. Conventional models for inference have high latency and low throughput in decision making models/processes. The disclosed method and system exploits parallelism in processing of XGB models and hence enables minimum possible latency and maximum possible throughput. Additionally, the disclosed system uses a trained model that is (re)trained using only those features which the model had used during training, remaining features are discarded during retraining of the model. The use of such selected set of features thus leads to reduction in the size of digital circuit significantly for the hardware implementation, thereby greatly enhancing the system performance.
Public/Granted literature
- US20220237141A1 FPGA IMPLEMENTATION OF LOW LATENCY ARCHITECTURE OF XGBOOST FOR INFERENCE AND METHOD THEREFOR Public/Granted day:2022-07-28
Information query