Method and system for scalable acceleration of data processing pipeline

Invention Grant

US12050563B2 Method and system for scalable acceleration of data processing pipeline 有权

Please log in to see more content

Patent Title: Method and system for scalable acceleration of data processing pipeline
Application No.: US18049363

Application Date: 2022-10-25
Publication No.: US12050563B2

Publication Date: 2024-07-30
Inventor: Mayank Mishra , Archisman Bhowmick , Rekha Singhal
Applicant: Tata Consultancy Services Limited
Applicant Address: IN Mumbai
Assignee: TATA CONSULTANCY SERVICES LIMITED
Current Assignee: TATA CONSULTANCY SERVICES LIMITED
Current Assignee Address: IN Mumbai
Agency: FINNEGAN, HENDERSON, FARABOW, GARRETT & DUNNER LLP
Priority: IN 2121058260 2021.12.14
Main IPC: G06F17/00
IPC: G06F17/00 ; G06F7/00 ; G06F16/21

Method and system for scalable acceleration of data processing pipeline

Abstract:

The present disclosure provides a scalable acceleration of data processing in Machine Learning pipeline which is unavailable in conventional methods. Initially, the system receives a dataset and a data processing code. A plurality of sample datasets are obtained based on the received dataset using a sampling technique. A plurality of performance parameters corresponding to each of the plurality of sample datasets are obtained based on the data processing code using a profiling technique. A plurality of scalable performance parameters corresponding to each of a plurality of larger datasets are predicted based on the plurality of performance parameters and the data processing code using a curve fitting technique. Simultaneously, a plurality of anti-patterns are located in the data processing code using a pattern matching technique. Finally, an accelerated code is recommended based on the plurality of anti-patterns and the predicted plurality of scalable performance parameters using an accelerated code recommendation technique.

Public/Granted literature

US20230185778A1 METHOD AND SYSTEM FOR SCALABLE ACCELERATION OF DATA PROCESSING PIPELINE. Public/Granted day:2023-06-15

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F17/00	特别适用于特定功能的数字计算设备或数据处理设备或数据处理方法（信息检索，数据库结构或文件系统结构，G06F 16/00）