Invention Grant
US07756919B1 Large-scale data processing in a distributed and parallel processing enviornment 有权
在分布式和并行处理环境中进行大规模数据处理

Large-scale data processing in a distributed and parallel processing enviornment
Abstract:
A large-scale data processing system and method includes one or more application-independent map modules configured to read input data and to apply at least one application-specific map operation to the input data to produce intermediate data values, wherein the map operation is automatically parallelized across multiple processors in the parallel processing environment. A plurality of intermediate data structures are used to store the intermediate data values. One or more application-independent reduce modules are configured to retrieve the intermediate data values and to apply at least one application-specific reduce operation to the intermediate data values to provide output data.
Information query
Patent Agency Ranking
0/0