Invention Grant
- Patent Title: Identifying reroutable data columns in an ETL process
- Patent Title (中): 在ETL过程中识别可重新排序的数据列
-
Application No.: US13217714Application Date: 2011-08-25
-
Publication No.: US09053576B2Publication Date: 2015-06-09
- Inventor: Helmut Baumgartner , Christian Gaege , Steffen Kabisch , Stefanie Scherzinger , Sergej Schuetz
- Applicant: Helmut Baumgartner , Christian Gaege , Steffen Kabisch , Stefanie Scherzinger , Sergej Schuetz
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Edell, Shapiro & Finnan, LLC
- Agent Susan Murray
- Priority: EP10196089 20101221
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06T11/20

Abstract:
Reroutable data columns are identified in an ETL process by receiving an ETL process definition describing a set of processing stages and how each processing stage output data column is a result of a function that operates on a set of input data columns, representing the ETL process definition as a directed graph with nodes representing processing stages and links representing data flow between processing stages, traversing at least part of the directed graph and identifying a set of subsequent nodes of the directed graph where at least one data column is involved only as input data in identity functions, the at least one data column being reroutable between outmost nodes of the set of subsequent nodes, and in connection with traversing the at least part of the directed graph, maintaining information about reroutable data columns and the respective outmost nodes.
Public/Granted literature
- US20120154405A1 Identifying Reroutable Data Columns in an ETL Process Public/Granted day:2012-06-21
Information query