-
公开(公告)号:CA2415018C
公开(公告)日:2006-09-19
申请号:CA2415018
申请日:2002-12-23
Applicant: IBM CANADA
Inventor: LEITCH MARK D , LIGHTSTONE SAM S , LAU LEO TAT MAN , BERKS ROBERT T , FLASZA MIROSLAW A , TREMAINE DAVID
IPC: G06F16/22 , G06F3/06 , G06F12/06 , G06F12/0882
Abstract: Loading input data into a multi-dimensional clustering (MDC) table or other structure containing data clustered along one or more dimensions entails assembling blocks of data in a partial block cache in which each partial block is associated with a distinc t logical cell. A minimum threshold number of partial blocks may be maintained. Partial blocks may be spilled from the partial block cache to make room for new logical cells. Last partia l pages of spilled partial blocks may be stored in a partial page cache to limit I/O if the cel l associated with a spilled block is encountered later in the input data stream. Buffers may be reassign ed from the partial block cache to the partial page cache if the latter is filled. Parallelism m ay be employed for efficiency during sorting of input data subsets and during storage of blocks to secondary storage.
-
公开(公告)号:CA2702133A1
公开(公告)日:2010-07-24
申请号:CA2702133
申请日:2010-05-21
Applicant: IBM CANADA
Inventor: LEW IVAN , SANTARS ALEKSANDRS , SARDAR ADIL M , TREMAINE DAVID
-
公开(公告)号:CA2415018A1
公开(公告)日:2004-06-23
申请号:CA2415018
申请日:2002-12-23
Applicant: IBM CANADA
Inventor: LAU LEO TAT MAN , LEITCH MARK D , FLASZA MIROSLAW A , TREMAINE DAVID , LIGHTSTONE SAM S , BERKS ROBERT T
IPC: G06F16/22 , G06F3/06 , G06F12/06 , G06F12/0882 , G06F17/30
Abstract: Loading input data into a multi-dimensional clustering (MDC) table or other structure containing data clustered along one or more dimensions entails assembling blocks of data in a partial block cache in which each partial block is associated with a distinc t logical cell. A minimum threshold number of partial blocks may be maintained. Partial blocks may be spilled from the partial block cache to make room for new logical cells. Last partia l pages of spilled partial blocks may be stored in a partial page cache to limit I/O if the cel l associated with a spilled block is encountered later in the input data stream. Buffers may be reassign ed from the partial block cache to the partial page cache if the latter is filled. Parallelism m ay be employed for efficiency during sorting of input data subsets and during storage of blocks to secondary storage.
-
公开(公告)号:CA2383713A1
公开(公告)日:2003-10-26
申请号:CA2383713
申请日:2002-04-26
Applicant: IBM CANADA
Inventor: HURAS MATTHEW A , TREMAINE DAVID , BERKS ROBERT T , FLASZA MIROSLAW A , VALENTIN GARY , LAU LEO T M , MCARTHUR CATHERINE S , WINER MICHAEL J
IPC: G06F16/901 , G06F17/30 , G06F17/60
Abstract: The invention herein provides method and apparatus for managing attribute- tagged index entries contained in an index by assigning or tagging an index entry attribu te indicator to a newly inserted index entry (that is, at the time when a key data value is inserted into the index, the key data value becomes a newly inserted index entry and an attribute indicator i s tagged to the newly inserted index entry). The index entry can be placed anywhere within the index. Once a group of attribute-tagged index entries have been inserted into the index, the attribute identifiers can be switched off instantaneously (that is, the attribute associated with the attribute indicators is disabled). For example, an attribute identifier can be used to defer accessibility of appropriately attribute-tagged index entries until a predetermined time later when the attribute can be disabled and thus allow users to access the attribute-tagged index entries without having to remove the attribute indicators from the index).
-
-
-