Instruction prefetch based on thread dispatch commands

    公开(公告)号:US11157283B2

    公开(公告)日:2021-10-26

    申请号:US16243663

    申请日:2019-01-09

    Abstract: A graphics processing device comprises a set of compute units to execute multiple threads of a workload, a cache coupled with the set of compute units, and a prefetcher to prefetch instructions associated with the workload. The prefetcher is configured to use a thread dispatch command that is used to dispatch threads to execute a kernel to prefetch instructions, parameters, and/or constants that will be used during execution of the kernel. Prefetch operations for the kernel can then occur concurrently with thread dispatch operations.

    BARRIER SYNCHRONIZATION MECHANISM
    78.
    发明申请

    公开(公告)号:US20210263785A1

    公开(公告)日:2021-08-26

    申请号:US16798603

    申请日:2020-02-24

    Abstract: An apparatus to facilitate thread barrier synchronization is disclosed. The apparatus includes a plurality of processing resources to execute a plurality of execution threads included in a thread workgroup and barrier synchronization hardware to assign a first named barrier to a first set of the plurality of execution threads in the thread workgroup, assign a second named barrier to a second set of the plurality of execution threads in the thread workgroup, synchronize execution of the first set of execution threads via the first named barrier and synchronize execution of the second set of execution threads via the second named barrier.

Patent Agency Ranking