Invention Grant
- Patent Title: Clustering for parallel processing
- Patent Title (中): 并行处理聚类
-
Application No.: US15148661Application Date: 2016-05-06
-
Publication No.: US09535742B1Publication Date: 2017-01-03
- Inventor: Joseph S. Beda, III , Grzegorz J. Czajkowski , Yonggang Zhao
- Applicant: Google Inc.
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06F9/455
- IPC: G06F9/455

Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for clustering for parallel processing. One of the methods includes providing virtual machines with an interface to a shuffle service, the shuffle service executing external of the virtual machines. The method includes receiving data records through the interface, each data record having a key and a value. The method includes partitioning the data records, using the shuffle service, according to the respective keys. The method includes providing a part of the partitioned data records through the interface to the virtual machines, wherein data records having the same key are provided to the same virtual machine. Each of the virtual machines can execute on a host machine and each of the virtual machine is a hardware virtualization of a machine.
Information query