THREAD SCHEDULING OVER COMPUTE BLOCKS FOR POWER OPTIMIZATION

    公开(公告)号:US20240362741A1

    公开(公告)日:2024-10-31

    申请号:US18662337

    申请日:2024-05-13

    CPC classification number: G06T1/20 G06F9/5011 Y02D10/00

    Abstract: One embodiment provides for a general-purpose graphics processing unit comprising a processing array including multiple compute blocks, each compute block including multiple processing clusters and a thread dispatch unit to dispatch threads of a workload to the multiple compute blocks based on a parallelism metric, wherein the thread dispatch unit, based on the parallelism metric, is to perform one of a first operation and a second operation, the first operation to distribute threads across the multiple compute blocks and the second operation is to concentrate threads within one of the multiple compute blocks.

    THREAD SCHEDULING OVER COMPUTE BLOCKS FOR POWER OPTIMIZATION

    公开(公告)号:US20220156875A1

    公开(公告)日:2022-05-19

    申请号:US17527689

    申请日:2021-11-16

    Abstract: Thread dispatch circuitry is configured to dispatch threads of a two-dimensional (2D) thread group based on data access locality associated with the threads. The thread dispatch circuitry can dispatch a first 2D sub-group of the 2D thread group to a compute block of the multiple compute blocks, the first 2D sub-group associated with a first 2D tile of memory and dispatch a second 2D sub-group of the 2D thread group to the compute block of the multiple compute blocks, the second 2D sub-group associated with a second 2D tile of memory.

Patent Agency Ranking