Fine-grain synchronization in data-parallel jobs for distributed machine learning

Invention Grant

US10402235B2 Fine-grain synchronization in data-parallel jobs for distributed machine learning 有权

Please log in to see more content

Patent Title: Fine-grain synchronization in data-parallel jobs for distributed machine learning
Application No.: US15980196

Application Date: 2018-05-15
Publication No.: US10402235B2

Publication Date: 2019-09-03
Inventor: Asim Kadav , Erik Kruus
Applicant: NEC Laboratories America, Inc.
Applicant Address: JP
Assignee: NEC CORPORATION
Current Assignee: NEC CORPORATION
Current Assignee Address: JP
Agent Joseph Kolodka
Main IPC: G06F9/46
IPC: G06F9/46 ; G06F9/52 ; G06N20/00 ; H04L29/08

Fine-grain synchronization in data-parallel jobs for distributed machine learning

Abstract:

A computer-implemented method and computer processing system are provided. The method includes synchronizing, by a processor, respective ones of a plurality of data parallel workers with respect to an iterative distributed machine learning process. The synchronizing step includes individually continuing, by the respective ones of the plurality of data parallel workers, from a current iteration to a subsequent iteration of the iterative distributed machine learning process, responsive to a satisfaction of a predetermined condition thereby. The predetermined condition includes individually sending a per-receiver notification from each sending one of the plurality of data parallel workers to each receiving one of the plurality of data parallel workers, responsive to a sending of data there between. The predetermined condition further includes individually sending a per-receiver acknowledgement from the receiving one to the sending one, responsive to a consumption of the data thereby.

Public/Granted literature

US20180260256A1 FINE-GRAIN SYNCHRONIZATION IN DATA-PARALLEL JOBS FOR DISTRIBUTED MACHINE LEARNING Public/Granted day:2018-09-13

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F9/00	程序控制装置，例如，控制单元（用于外部设备的程序控制入G06F13/10）
G06F9/06	.应用存入的程序的，即应用处理设备的内部存储来接收程序并保持程序的
G06F9/46	..多道程序装置