Invention Grant
- Patent Title: Joining tables in a mapreduce procedure
- Patent Title (中): 在mapreduce过程中连接表
-
Application No.: US13209567Application Date: 2011-08-15
-
Publication No.: US08924426B2Publication Date: 2014-12-30
- Inventor: Biswapesh Chattopadhyay , Liang Lin
- Applicant: Biswapesh Chattopadhyay , Liang Lin
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06F17/30 ; G06F9/50

Abstract:
Systems and techniques by which tables can be joined in a mapreduce procedure. In some implementations, when a large table of business data (e.g., having one billion transaction records or more) is to be joined with a large table of customer data (e.g., having hundreds of millions of customer records), then these two tables can be organized before the mapreduce procedure to speed up the table join. For example, the business data and the customer data can both be hash partitioned, based on the same key, into shards of business data and shards of customer data, respectively. The number of shards in these two groups has an integer relationship with each other: for example such that there are two business data shards for every customer data shard, or vice versa.
Public/Granted literature
- US20120278323A1 Joining Tables in a Mapreduce Procedure Public/Granted day:2012-11-01
Information query