Abstract:
The present provides an instruction removing mechanism and a method using the same. The instruction removing mechanism is capable of scanning a graphic program to determine whether there is any simple texture load instruction (texld instruction) in the program. The simple texld instructions will be transmitted directly to the texture unit and deleted from a texld instruction collector to prevent the pixel shader executing the simple texld instructions before the texture unit.
Abstract:
An instruction folding mechanism, a method for performing the instruction folding mechanism and a pixel processing system employing the instruction folding mechanism are described. The pixel processing system comprises an instruction folding mechanism and a pixel shader. The instruction folding mechanism folds a plurality of first instructions in a first program to generate a second program having at least one second instruction which is a combination of the first instructions. The pixel shader connected to the instruction folding mechanism fetches the second program to decode at least the second instruction having the combination of the first instructions to execute the second program. The instruction folding mechanism comprises an instruction scheduler, a folding rule checker, and an instruction combiner. The instruction scheduler connected to the folding rule checker is used to scan the first instructions according to static positions in order to schedule the first instructions in the first program. The folding rule checker checks the first instructions according to a folding rule whether the first instructions has data independency. The instruction combiner connected to the folding rule checker can combine the first instructions having the data independency to generate at least the second instruction.
Abstract:
An early retiring instruction mechanism, a method for performing the early retiring instruction mechanism and a pixel processing system employing the early retiring instruction mechanism applied to a graphic processor unit (GPU) are described. The pixel processing system comprises an early retiring instruction mechanism and a pixel shader. The early retiring instruction mechanism selectively retires a plurality of instructions in a first program in order to generate at least one early retiring instruction in a second program. The pixel shader is connected to the early retiring instruction mechanism. The pixel shader fetches the second program and decodes at least one early retiring instruction to execute the second program therein for processing a plurality of pixels. Then, the pixel shader checks whether the pixels in the process of the early retiring instruction generated from early retiring instruction mechanism are directly issued to leave the pixel shader in advance. The early retiring instruction is an explicit retiring instruction, a retiring flow-control instruction or an instruction having a retire bit.
Abstract:
A register-collecting mechanism and method using the same for multi-threaded processors are described. The register-collecting mechanism includes an instruction scanner, a register mapping table, an instruction modifier and an indication reporter. The instruction scanner scans one or more first programs having a plurality of first instructions and decode each of the first instructions to extract a plurality of nominal register numbers from the first instructions. The register mapping table compares the nominal register numbers of the first instructions to determine whether to collect a plurality of physical register numbers in sequence of register numbers when at least one of the nominal register numbers is unmapped with respective physical register number previously stored within the register mapping table. The instruction modifier is able to correct the nominal register numbers to generate a second program having a plurality of second instructions which are composed of the sequential physical register numbers collected in the register mapping table.
Abstract:
An instruction folding mechanism, a method for performing the instruction folding mechanism and a pixel processing system employing the instruction folding mechanism are described. The pixel processing system comprises an instruction folding mechanism and a pixel shader. The instruction folding mechanism folds a plurality of first instructions in a first program to generate a second program having at least one second instruction which is a combination of the first instructions. The pixel shader connected to the instruction folding mechanism fetches the second program to decode at least the second instruction having the combination of the first instructions to execute the second program. The instruction folding mechanism comprises an instruction scheduler, a folding rule checker, and an instruction combiner. The instruction scheduler connected to the folding rule checker is used to scan the first instructions according to static positions in order to schedule the first instructions in the first program. The folding rule checker checks the first instructions according to a folding rule whether the first instructions has data independency. The instruction combiner connected to the folding rule checker can combine the first instructions having the data independency to generate at least the second instruction.
Abstract:
An instruction folding mechanism, a method for performing the instruction folding mechanism and a pixel processing system employing the instruction folding mechanism are described. The pixel processing system comprises an instruction folding mechanism and a pixel shader. The instruction folding mechanism folds a plurality of first instructions in a first program to generate a second program having at least one second instruction which is a combination of the first instructions. The pixel shader connected to the instruction folding mechanism fetches the second program to decode at least the second instruction having the combination of the first instructions to execute the second program. The instruction folding mechanism comprises an instruction scheduler, a folding rule checker, and an instruction combiner. The instruction scheduler connected to the folding rule checker is used to scan the first instructions according to static positions in order to schedule the first instrictions in the first program. The folding rule checker checks the first instructions according to a folding rule whether the first instructions has data independency. The instruction combiner connected to the folding rule checker can combine the first instructions having the data independency to generate at least the second instruction.