-
公开(公告)号:CA2436412A1
公开(公告)日:2002-09-06
申请号:CA2436412
申请日:2002-02-25
Applicant: IBM
Inventor: GARA ALAN G , COTEUS PAUL W , JACKSON RORY D , KOPCSAY GERARD V , VRANAS PAVLOS M , TAKKEN TODD E , NATHANSON BEN J , BARRETT WAYNE M , CHEN DONG
IPC: G06F11/10 , G06F9/46 , G06F9/52 , G06F11/00 , G06F11/20 , G06F12/00 , G06F12/02 , G06F12/08 , G06F12/10 , G06F13/00 , G06F13/24 , G06F13/38 , G06F15/173 , G06F15/177 , G06F15/80 , G06F17/14 , H04L1/00 , H04L7/02 , H04L7/033 , H04L12/28 , H04L12/56 , H04L25/02 , H05K7/20 , H04L7/00 , H04B10/08 , H03D3/24
Abstract: A data capture technique for high speed signaling to allow for optimal sampling of an asynchronous data stream. This technique allows for extremely high data rates and does not require that a clock be sent with the data as i s done in source synchronous systems. The present invention also provides a hardware mechanism for automatically adjusting transmission delays (Fig. 5) for optimal two-bit simultaneous bi-directional (SiBiDi) signaling.
-
公开(公告)号:DE60236510D1
公开(公告)日:2010-07-08
申请号:DE60236510
申请日:2002-02-25
Applicant: IBM
Inventor: CHEN DONG , COTEUS PAUL W , GARA ALAN G , GIAMPAPA MARK E , TAKKEN TODD E
-
公开(公告)号:DE60233055D1
公开(公告)日:2009-09-03
申请号:DE60233055
申请日:2002-02-25
Applicant: IBM
Inventor: BLUMRICH MATTHIAS A , CHEN DONG , COTEUS PAUL W , GARA ALAN G , GIAMPAPA MARK E , HEIDELBERGER PHILIP , HOENICKE DIRK , OHNMACHT MARTIN
IPC: G06F11/10 , G06F12/00 , G06F9/46 , G06F9/52 , G06F11/00 , G06F11/20 , G06F12/02 , G06F12/08 , G06F12/10 , G06F13/00 , G06F13/24 , G06F13/38 , G06F15/173 , G06F15/177 , G06F15/80 , G06F17/14 , H04L1/00 , H04L7/02 , H04L7/033 , H04L12/28 , H04L12/56 , H04L25/02 , H05K7/20
Abstract: A method and apparatus for managing coherence between two processors of a two processor node of a multi-processor computer system. Generally the present invention relates to a software algorithm that simplifies and significantly speeds the management of cache coherence in a message passing parallel computer, and to hardware apparatus that assists this cache coherence algorithm. The software algorithm uses the opening and closing of put/get windows to coordinate the activated required to achieve cache coherence. The hardware apparatus may be an extension to the hardware address decode, that creates, in the physical memory address space of the node, an area of virtual memory that (a) does not actually exist, and (b) is therefore able to respond instantly to read and write requests from the processing elements.
-
公开(公告)号:AT437402T
公开(公告)日:2009-08-15
申请号:AT02713681
申请日:2002-02-25
Applicant: IBM
Inventor: BLUMRICH MATTHIAS A , CHEN DONG , COTEUS PAUL W , GARA ALAN G , GIAMPAPA MARK E , HEIDELBERGER PHILIP , HOENICKE DIRK , OHNMACHT MARTIN
IPC: G06F11/10 , G06F12/00 , G06F9/46 , G06F9/52 , G06F11/00 , G06F11/20 , G06F12/02 , G06F12/08 , G06F12/10 , G06F13/00 , G06F13/24 , G06F13/38 , G06F15/173 , G06F15/177 , G06F15/80 , G06F17/14 , H04L1/00 , H04L7/02 , H04L7/033 , H04L12/28 , H04L12/56 , H04L25/02 , H05K7/20
Abstract: A method and apparatus for managing coherence between two processors of a two processor node of a multi-processor computer system. Generally the present invention relates to a software algorithm that simplifies and significantly speeds the management of cache coherence in a message passing parallel computer, and to hardware apparatus that assists this cache coherence algorithm. The software algorithm uses the opening and closing of put/get windows to coordinate the activated required to achieve cache coherence. The hardware apparatus may be an extension to the hardware address decode, that creates, in the physical memory address space of the node, an area of virtual memory that (a) does not actually exist, and (b) is therefore able to respond instantly to read and write requests from the processing elements.
-
公开(公告)号:CA2438195C
公开(公告)日:2009-02-03
申请号:CA2438195
申请日:2002-02-25
Applicant: IBM
Inventor: COTEUS PAUL W , GARA ALAN G , CHEN DONG , BLUMRICH MATTHIAS A , VRANAS PAVLOS M , TAKKEN TODD E , STEINMACHER-BUROW BURKHARD D , HEIDELBERGER PHILIP , GIAMPAPA MARK E
IPC: G06F11/10 , G06F15/173 , G06F9/46 , G06F9/52 , G06F11/00 , G06F11/20 , G06F12/00 , G06F12/02 , G06F12/08 , G06F12/10 , G06F13/00 , G06F13/24 , G06F13/38 , G06F15/177 , G06F15/80 , G06F17/14 , H04L1/00 , H04L7/02 , H04L7/033 , H04L12/28 , H04L12/54 , H04L12/56 , H04L25/02 , H05K7/20
Abstract: In a massively parallel computing system having a plurality of nodes configured in m multi-dimensions, each node including a computing device, a method for routing packets towards their destination nodes is provided which includes generating at least one of a 2m plurality of compact bit vectors (115, 154) containing information derived from downstream nodes. A multileve l arbitration process (116, 155) in which downstream information stored in the compact vectors, such as link status information and fullness of downstream buffers (130, 140), is used to determine a preferred direction and virtual channel for packet transmission. Preferred direction ranges are encoded and virtual channels are selected by examining the plurality of compact bit vectors (115, 154). This dynamic routing method eliminates the necessity of routing tables, thus enhancing scalability of the switch.
-
公开(公告)号:CA2436395A1
公开(公告)日:2002-09-06
申请号:CA2436395
申请日:2002-02-25
Applicant: IBM
Inventor: GIAMPAPA MARK E , GARA ALAN G , COTEUS PAUL W , TAKKEN TODD E , CHEN DONG
IPC: G06F11/10 , G06F9/46 , G06F9/52 , G06F11/00 , G06F11/20 , G06F12/00 , G06F12/02 , G06F12/08 , G06F12/10 , G06F13/00 , G06F13/24 , G06F13/38 , G06F15/173 , G06F15/177 , G06F15/80 , G06F17/14 , H04L1/00 , H04L7/02 , H04L7/033 , H04L12/28 , H04L12/56 , H04L25/02 , H05K7/20 , G06F15/16
Abstract: In a massively parallel system, a method and apparatus for uniquely assignin g a MAC address(400) to a device encodes the MAC address with a physical location of the device(410). The method and apparatus include configuring device interconnections of the parallel system with physical topological information such as a rack number, a midplane number, a card number, and a chip number. A device or node with a physical location encoded MAC address m ay then be interrogated by location for test, diagnostic, and program loading purposes.
-
公开(公告)号:CA2436395C
公开(公告)日:2011-07-12
申请号:CA2436395
申请日:2002-02-25
Applicant: IBM
Inventor: CHEN DONG , COTEUS PAUL W , GARA ALAN G , GIAMPAPA MARK E , TAKKEN TODD E
IPC: G06F11/10 , G06F15/16 , G06F9/46 , G06F9/52 , G06F11/00 , G06F11/20 , G06F12/00 , G06F12/02 , G06F12/08 , G06F12/10 , G06F13/00 , G06F13/24 , G06F13/38 , G06F15/173 , G06F15/177 , G06F15/80 , G06F17/14 , H04L1/00 , H04L7/02 , H04L7/033 , H04L12/28 , H04L12/56 , H04L25/02 , H05K7/20
Abstract: In a massively parallel system, a method and apparatus for uniquely assigning a MAC address(400) to a device encodes the MAC address with a physical location of the device(410). The method and apparatus include configuring device interconnections of the parallel system with physical topological information such as a rack number, a midplane number, a card number, and a chip number. A device or node with a physical location encoded MAC address may then be interrogated by location for test, diagnostic, and program loading purposes.
-
公开(公告)号:DE60221235T2
公开(公告)日:2008-04-10
申请号:DE60221235
申请日:2002-02-25
Applicant: IBM
Inventor: BARRETT WAYNE M , CHEN DONG , COTEUS PAUL W , GARA ALAN G , JACKSON RORY D , KOPCSAY GERARD V , NATHANSON BEN J , TAKKEN TODD E , VRANAS PAVLOS M
IPC: G06F11/10 , H04L7/00 , G06F9/46 , G06F9/52 , G06F11/00 , G06F11/20 , G06F12/00 , G06F12/02 , G06F12/08 , G06F12/10 , G06F13/00 , G06F13/24 , G06F13/38 , G06F15/173 , G06F15/177 , G06F15/80 , G06F17/14 , H03D3/24 , H04B10/08 , H04L1/00 , H04L7/02 , H04L7/033 , H04L12/28 , H04L12/56 , H04L25/02 , H05K7/20
Abstract: A data capture technique for high speed signaling to allow for optimal sampling of an asynchronous data stream. This technique allows for extremely high data rates and does not require that a clock be sent with the data as is done in source synchronous systems. The present invention also provides a hardware mechanism for automatically adjusting transmission delays for optimal two-bit simultaneous bi-directional (SiBiDi) signaling.
-
公开(公告)号:DE60221235D1
公开(公告)日:2007-08-30
申请号:DE60221235
申请日:2002-02-25
Applicant: IBM
Inventor: BARRETT WAYNE M , CHEN DONG , COTEUS PAUL W , GARA ALAN G , JACKSON RORY D , KOPCSAY GERARD V , NATHANSON BEN J , TAKKEN TODD E , VRANAS PAVLOS M
IPC: G06F11/10 , H04L7/00 , G06F9/46 , G06F9/52 , G06F11/00 , G06F11/20 , G06F12/00 , G06F12/02 , G06F12/08 , G06F12/10 , G06F13/00 , G06F13/24 , G06F13/38 , G06F15/173 , G06F15/177 , G06F15/80 , G06F17/14 , H03D3/24 , H04B10/08 , H04L1/00 , H04L7/02 , H04L7/033 , H04L12/28 , H04L12/56 , H04L25/02 , H05K7/20
Abstract: A data capture technique for high speed signaling to allow for optimal sampling of an asynchronous data stream. This technique allows for extremely high data rates and does not require that a clock be sent with the data as is done in source synchronous systems. The present invention also provides a hardware mechanism for automatically adjusting transmission delays for optimal two-bit simultaneous bi-directional (SiBiDi) signaling.
-
公开(公告)号:CA2437661A1
公开(公告)日:2002-09-06
申请号:CA2437661
申请日:2002-02-25
Applicant: IBM
Inventor: HOENICKE DIRK , BLUMRICH MATTHIAS A , HEIDELBERGER PHILIP , CHEN DONG , TAKKEN TODD E , GIAMPAPA MARK E , GARA ALAN G , COTEUS PAUL W , VRANAS PAVLOS M , STEINMACHER-BUROW BURKHARD D
IPC: G06F11/10 , G06F9/46 , G06F9/52 , G06F11/00 , G06F11/20 , G06F12/00 , G06F12/02 , G06F12/08 , G06F12/10 , G06F13/00 , G06F13/24 , G06F13/38 , G06F15/16 , G06F15/173 , G06F15/177 , G06F15/76 , G06F15/80 , G06F17/14 , H04L1/00 , H04L7/02 , H04L7/033 , H04L12/28 , H04L12/56 , H04L25/02 , H05K7/20 , G06F15/00 , H04M1/64
Abstract: A system and method for enabling high-speed, low-latency global tree communications among processing nodes interconnected according to a tree network structure. The global tree network (100) optimally enables collectiv e reduction operations to be performed during parallel algorithm operations executing in a computer structure having a plurality of the interconnected processing nodes. Router devices (200) are included that interconnect the nodes of the tree via links to facilitate performance of low-latency global processing operations at nodes of the virtual tree and sub-tree structures. The global operations include one or more of: global broadcast operations downstream from a root node (110) to leaf nodes (120) of a virtual tree, global reduction operations upstream from leaf nodes to the root node (110) in the virtual tree, and point-to-point message passing from and any node to th e root node (110) in the virtual tree. One node of the virtual tree network is coupled to and functions as an I/O node for providing I/O functionality with an external system for each node of the virtual tree. The global tree networ k (100) may be configured to provide global barrier and interrupt functionalit y in asynchronous or synchronized manner. Thus, parallel algorithm processing operations, for example,employed in parallel computing systems, may be optimally performed in accordance with certain operating phases of the parallel algorithm operations. When implemented in a massively-parallel supercomputing structure, the global tree network (100) is physically and logically partitionable according to needs of a processing algorithm.
-
-
-
-
-
-
-
-
-