Elastic management of machine learning computing

Invention Grant

US10649806B2 Elastic management of machine learning computing 有权

Please log in to see more content

Patent Title: Elastic management of machine learning computing
Application No.: US15951088

Application Date: 2018-04-11
Publication No.: US10649806B2

Publication Date: 2020-05-12
Inventor: Aurick Qiao , Qirong Ho , Eric Xing
Applicant: Petuum Inc.
Applicant Address: US PA Pittsburgh
Assignee: PETUUM, INC.
Current Assignee: PETUUM, INC.
Current Assignee Address: US PA Pittsburgh
Main IPC: G06F9/48
IPC: G06F9/48 ; G06F9/50 ; G06F9/28 ; G06N20/00

Elastic management of machine learning computing

Abstract:

A computer system implemented a method for elastic resource management for executing a machine learning (ML) program. The system is configured to create a set of logical executors, assign them across a set of networked physical computation units of a distributed computing system, partition and distribute input data and Work Tasks across the set of logical executors, assign them across the set of networked physical computation units, where the Work Tasks are partitioned into short units of computation (micro-tasks), each calculates a partial update to the ML program's model parameters and each last for less than one second; create a set of logical servers (LSes); partition and distribute globally shared model parameters of the ML program across the set of logical servers; execute partitioned Work Tasks according to a bounded asynchronous parallel standard, where a current Work Task is allowed to execute with stale model parameters without having all the current calculation updates from Work Tasks it depend on, provided the staleness of the model parameters is within a predefined limit.

Public/Granted literature

US20180300171A1 Elastic Management of Machine Learning Computing Public/Granted day:2018-10-18

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F9/00	程序控制装置，例如，控制单元（用于外部设备的程序控制入G06F13/10）
G06F9/06	.应用存入的程序的，即应用处理设备的内部存储来接收程序并保持程序的
G06F9/46	..多道程序装置
G06F9/48	...程序启动；程序切换，例如通过中断