Streaming joins with synchronization via stream time estimations
Abstract:
Two streams of data items are received. A first estimated processing time for the first stream of data items and a second estimated processing time for the second stream of data items are determined. Data items of the first stream and data items of the second stream are dynamically maintained in a first buffer and a second buffer respectively. The data items of the second stream maintained in the second buffer have associated event times that are within a first join window based on the first estimated processing time for the first stream. A selected data item of the first stream maintained in the first buffer is joined with one or more data items of the second stream maintained in the second buffer that have associated event times that are within a second join window based on an event time associated with the selected data item of the first stream.
Public/Granted literature
Information query
Patent Agency Ranking
0/0