THREAD SCHEDULING OVER COMPUTE BLOCKS FOR POWER OPTIMIZATION

    公开(公告)号:US20200327635A1

    公开(公告)日:2020-10-15

    申请号:US16714862

    申请日:2019-12-16

    Abstract: One embodiment provides for a general-purpose graphics processing unit comprising a processing array including multiple compute blocks, each compute block including multiple processing clusters and a thread dispatch unit to dispatch threads of a workload to the multiple compute blocks based on a parallelism metric, wherein the thread dispatch unit, based on the parallelism metric, is to perform one of a first operation and a second operation, the first operation to distribute threads across the multiple compute blocks and the second operation is to concentrate threads within one of the multiple compute blocks.

    SYSTEM AND METHOD TO SUPPORT MULTIPLE WALKERS PER COMMAND

    公开(公告)号:US20200286201A1

    公开(公告)日:2020-09-10

    申请号:US16297129

    申请日:2019-03-08

    Abstract: Embodiments described herein provide an apparatus comprising a processor to configure a plurality of contexts of a command engine to execute a graphics workload comprising a plurality of walkers, allocate, from a pool of execution units of a graphics processor, a subset of execution units to each walker in the plurality of walkers based at least in part on the predetermined number of walkers configured for the context, for each context in the plurality of contexts, dispatch one or more walkers of the plurality of walkers to the execution units, and upon dispatch of the one or more walkers of the plurality of walkers, write an opcode to a computer-readable memory indicating that the dispatch of the walker is complete, wherein the opcode comprises dependency data for the one or more walkers of the plurality of walkers. Other embodiments may be described and claimed.

Patent Agency Ranking