Invention Grant
US07756919B1 Large-scale data processing in a distributed and parallel processing enviornment
有权
在分布式和并行处理环境中进行大规模数据处理
- Patent Title: Large-scale data processing in a distributed and parallel processing enviornment
- Patent Title (中): 在分布式和并行处理环境中进行大规模数据处理
-
Application No.: US10871245Application Date: 2004-06-18
-
Publication No.: US07756919B1Publication Date: 2010-07-13
- Inventor: Jeffrey Dean , Sanjay Ghemawat
- Applicant: Jeffrey Dean , Sanjay Ghemawat
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Morgan, Lewis & Bockius LLP
- Main IPC: G06F15/16
- IPC: G06F15/16

Abstract:
A large-scale data processing system and method includes one or more application-independent map modules configured to read input data and to apply at least one application-specific map operation to the input data to produce intermediate data values, wherein the map operation is automatically parallelized across multiple processors in the parallel processing environment. A plurality of intermediate data structures are used to store the intermediate data values. One or more application-independent reduce modules are configured to retrieve the intermediate data values and to apply at least one application-specific reduce operation to the intermediate data values to provide output data.
Information query