Instruction prefetch mechanism
    21.
    发明授权

    公开(公告)号:US10963389B2

    公开(公告)日:2021-03-30

    申请号:US16787841

    申请日:2020-02-11

    Abstract: An apparatus to facilitate data prefetching is disclosed. The apparatus includes a cache, one or more execution units (EUs) to execute program code, prefetch logic to maintain tracking information of memory instructions in the program code that trigger a cache miss and compiler logic to receive the tracking information, insert one or more pre-fetch instructions in updated program code to prefetch data from a memory for execution of one or more of the memory instructions that triggered a cache miss and download the updated program code for execution by the one or more EUs.

    ENABLING VIRTUAL CALLS IN A SIMD ENVIRONMENT
    25.
    发明申请
    ENABLING VIRTUAL CALLS IN A SIMD ENVIRONMENT 审中-公开
    在SIMD环境中启用虚拟呼叫

    公开(公告)号:US20160162345A1

    公开(公告)日:2016-06-09

    申请号:US14877582

    申请日:2015-10-07

    Abstract: Systems and methods of enabling virtual calls in a single instruction multiple data (SIMD) environment may involve detecting a virtual call of a function and using a single dispatch of the function to invoke the virtual call for two or more channels of the virtual call. In one example, it is determined that the two or more channels share a common target address and a single dispatch of the function is conducted with respect to the common target address. The process may be iterated for additional channels of the virtual call that share a common target address.

    Abstract translation: 在单个指令多数据(SIMD)环境中启用虚拟呼叫的系统和方法可以涉及检测功能的虚拟呼叫,并且使用该功能的单个调度来调用虚拟呼叫的两个或多个信道的虚拟呼叫。 在一个示例中,确定两个或更多个信道共享公共目标地址,并且相对于公共目标地址进行该功能的单个调度。 可以对共享共同目标地址的虚拟呼叫的附加信道重复该过程。

    Multiple register allocation sizes for threads

    公开(公告)号:US12210905B2

    公开(公告)日:2025-01-28

    申请号:US17358650

    申请日:2021-06-25

    Abstract: Provision of multiple register allocation sizes for threads is described. An example of a system includes one or more processors including a graphics processor, the graphics processor including at least a first local thread dispatcher (TDL) and multiple processing resources, each processing resource including a plurality of registers; and memory for storage of data for processing, wherein the one or more processors are to determine a register size for a first thread; identify one or more processing resources having sufficient register space for the first thread; select a processing resource of the one or more processing resources having sufficient register space to assign the first thread; select an available thread slot of the selected processing resource for the first thread; and allocate registers of the selected processing resource for the first thread.

Patent Agency Ranking