Abstract:
A storage array including a local clock buffer with programmable timing provides a mechanism for evaluating circuit timing internal to the storage array. The local clock buffer can independently adjust the pulse width of a local clock that controls the wordline and local bitline precharge pulses and the pulse width of a delayed clock that controls the global bitline precharge, evaulate and read data latching. The delay between the local clock and the delayed clock can also be adjusted. By varying the pulse widths of the local and delayed clock signal, along with the inter-clock delay, the timing margins of each cell in the array can be evaluated by reading and writing the cell with varying pulse width and clock delay. The resulting evaluation can be used to evaluate timing margin variation within a die, as well variaton from die-to-die and under varying environments, e.g., voltage and temperature variation.
Abstract:
A delay circuit has a fixed delay path at a lower voltage level, a level converter, and an adjustable delay path at a higher voltage level. The fixed delay path includes an inverter chain, and the adjustable delay path includes serially-connected delay elements selectively connected to the circuit output. In an application for a local clock buffer of a static, random- access memory (SRAM), the lower voltage level is that of the local clock buffer, and the higher voltage level is that of the SRAM. These voltages may vary in response to dynamic voltage scaling, requiring re-calibration of the adjustable delay path. The adjustable delay path may be calibrated by progressively increasing the read access time of the SRAM array until a contemporaneous read operation returns the correct output, or by using a replica SRAM path to simulate variations in delay with changes in voltage supply.
Abstract:
PROBLEM TO BE SOLVED: To improve a method for evicting a cache line from a loose directory by writing data from a corrected cache line back to the local system memory of a node and evicting an entry from the loose directory later. SOLUTION: First of all, when corrected data are tried to return to the system memory, a correction intended read (RWITM) transaction can be tried again by any reason except the generation of retry by a processor. In this case, the RWITM transaction is forcedly issued again to a transaction reception unit by an AutoRetry mode. Secondly, it is possible for a corrected cache line to be absent in the remote node 10b and RWITM receives an erasure response. Afterwards, the transaction reception unit of a node controller 20b returns a response to an eviction logic and completes the eviction.
Abstract:
PROBLEM TO BE SOLVED: To avoid a lie block by completing any one of two requests in response to a request from the processor of a first node so as to invalidate the remote copy of a cache line stored in a cache memory. SOLUTION: A first request is selected so as to win (namely, Retry is not received yet). When an invalidation request is transferred to a requested remote node by a node controller 19 on the side of home node, a special bit called AutoRetry bit is set in a transaction. Read (RWITM) intending a correction from the home node 11 is repeated on the side of remote node 12 and Retry is locally performed. When Dclaim request from the remote node 12 is tried again back to a processor 24a, the opportunity to efficiently complete the RWITM request from the home node 11 on a local bus in the remote node 12 is increased.
Abstract:
PROBLEM TO BE SOLVED: To avoid lie block conditions by completing write back on the side of home node without retry only in the case of write back from the owned node of a corrected cache line by a coherence directory in the home node. SOLUTION: The cache coherence directory in a node controller 19 on the side of node 11 can be constructed so as to fetch back a cache line from the owned node for a non-processed request and to simultaneously discriminate time to receive a write back kill (WBC request) from the owned node of the cache line. A READ request is issued again by a processor 14a on the side of home node 11, a system memory 17 supplies effective data and the READ request is efficiently completed by a clean response. The lie block conditions can be avoided. Since the system memory 17 is updated into effective data by the WBC request, data can be supplied.
Abstract:
PROBLEM TO BE SOLVED: To avoid the loss of data and coherence by respectively providing first and second nodes with a local mutual connection, a system memory coupled with the local mutual connection and a node controller located between the local mutual connections. SOLUTION: A READ request in a home node 10a is transferred to a remote node 10b where corrected data are resident. Concerning the READ request, the boat of correction/intervention is received by the remote node 10b. The remote node 10b returns the corrected/intervened boat and the corrected data to the home node 10a. A node controller 20 generates a new tag and issue the write back kill (WBC request) of R bit = 1. A coherence mechanism in the node controller 20 completes the WBC request like ReRum of locally generated WBC requests. In this case, data in a local memory 18 are made effective.
Abstract:
PROBLEM TO BE SOLVED: To reduce thermal stress between a printed circuit board and an integrated circuit package by enabling a convection cooling means in response to an indication of high power state and disabling the convection cooling means in response to an indication of low power state. SOLUTION: A ball grid array packaged integrated circuit 2 is fixed to a printed board 1 and an array of solder balls 3 is connected with an integrated circuit chip 6 through a ceramic package base 4. A heat sink assembly 7 is fixed onto the packaged integrated circuit while being fitted tightly to the integrated circuit 6. The heat sink assembly 7 can be subjected to convection cooling with an air flow 8 upon driving a fan 9. A control switch 11 feeds power to the fan 9 and another fan in the housing of a computer system. The switch 11 enables or disables the fan 9 in response a RUN/SLEEP power management signal being fed on a line 12.
Abstract:
A storage array is provided with a local clock buffer 18 which produces local clock signal LCLK and delayed local clock signal DELLCLK both with adjustable tunings based upon global clock signal GLK. The adjustment of the local clock signals is provided by control signal(s) ADJUST input into the array. The ADJUST control signal(s) comprise at least one, preferably three, value(s) for setting at least a timing of the local clock signal(s). The local clock buffer can independently adjust the pulse width of a local clock that controls the wordline and local bitline precharge pulses and the pulse width of a delayed clock that controls the global bitline precharge, evaluate and read data latching. The delay between the local clock and the delayed clock can also be adjusted. By varying the pulse widths of the local and delayed clock signal, along with the inter-clock delay, the timing margins of each cell in the array can be evaluated by reading and writing the cell with varying pulse width and clock delay. There is also provided a mechanism and method for evaluating circuit timing internal to the storage array by varying the control value(s) until a point of failure is reached. The resulting evaluation can be used to evaluate timing margin variation within a die, as well variation from die-to-die and under varying environments, e.g., voltage and temperature variation.
Abstract:
A non-uniform memory access (NUMA) data processing system includes a node interconnect to which at least a first processing node and a second processing node are coupled. The first and the second processing nodes each include a local interconnect, a processor coupled to the local interconnect, a system memory coupled to the local interconnect, and a node controller interposed between the local interconnect and the node interconnect. In order to reduce communication latency, the node controller of the first processing node speculatively transmits request transactions received from the local interconnect of the first processing node to the second processing node via the node interconnect. In one embodiment, the node controller of the first processing node subsequently transmits a status signal to the node controller of the second processing node in order to indicate how the request transaction should be processed at the second processing node.
Abstract:
A power amplifier having a power MOS transistor output device. The gate drive for the power device is a bidirectional current source. In one form of the gate driver circuit, the bidirectional current source includes the capability of controlling the limits of the gate current, which in turn controls the slew rate of the power amplifier.