Invention Grant
- Patent Title: Optimization model for processing hierarchical data in stream systems
- Patent Title (中): 在流系统中处理分层数据的优化模型
-
Application No.: US11850588Application Date: 2007-09-05
-
Publication No.: US07860863B2Publication Date: 2010-12-28
- Inventor: Amir Bar-Or , Michael James Beckerle
- Applicant: Amir Bar-Or , Michael James Beckerle
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Konrad Raynes & Victor LLP
- Agent Janaki K. Davda
- Main IPC: G06F7/00
- IPC: G06F7/00

Abstract:
Provided are techniques for optimizing the processing of hierarchical data. A linear processing graph is received, wherein the linear processing graph includes a plurality of operators, wherein each operator in the plurality is connected to at least one other operator by an arc, wherein hierarchical data flows on arcs, wherein the operators read and replace identified subregions within the hierarchical data flowing into the operators on the arcs, and wherein the operators do not modify the hierarchical data outside of these identified subregions. For each operator in the linear processing graph, a minimal set of dependent upstream operators on which that operator depends is found by examining how the identified subregions are created in the linear processing graph through obtaining a set of operators on which that operator depends, by analyzing dependencies carried by a set of vector nodes of the hierarchical data in an input schema of the operator, and, for each of the vector nodes, by analyzing an associated set of scalar nodes, wherein finding the minimum set of operators includes taking into consideration data preservation characteristics of the plurality of operators and taking into consideration structural-order preservation characteristics of the plurality of operators. The linear processing graph is rewritten to create a new graph that expresses dependencies based on the minimal set of dependent upstream operators for each operator.
Public/Granted literature
- US20090063515A1 OPTIMIZATION MODEL FOR PROCESSING HIERARCHICAL DATA IN STREAM SYSTEMS Public/Granted day:2009-03-05
Information query