VIRTUAL ADDRESS ACCESS TO GPU SURFACE AND SAMPLER STATES

    公开(公告)号:US20240134527A1

    公开(公告)日:2024-04-25

    申请号:US17971290

    申请日:2022-10-20

    CPC classification number: G06F3/061 G06F3/0655 G06F3/0679 G06T1/60

    Abstract: Embodiments described herein provide a technique to enable access to entries in a surface state or sampler state using 64-bit virtual addresses. One embodiment provides a graphics core that includes memory access circuitry configured to facilitate access to the memory by functional units of the graphics core. The memory access circuitry is configured to receive a message to access an entry in a surface state or a sampler state associated with a parallel processing operation. The message specifies a base address for a surface state entry or sampler state entry. The circuitry can add the base address and the offset to determine a 64-bit virtual address for the entry in the surface state entry or the sampler state and submit a memory access request to the memory to access the entry of the surface state or sampler state.

    MERGING ATOMICS TO THE SAME CACHE LINE
    16.
    发明公开

    公开(公告)号:US20240087077A1

    公开(公告)日:2024-03-14

    申请号:US17944542

    申请日:2022-09-14

    CPC classification number: G06T1/60 G06T1/20

    Abstract: Embodiments described herein provide a technique to merge partial cache line writes to a cache memory. One embodiment provides a graphics processor comprising a graphics core, a cache coupled with the graphics core, and memory access circuitry to process memory access messages received from the graphics core. The memory access circuitry includes partial cache line write merge circuitry configured to merge a first partial write to a cache line of the cache with a second partial write to the cache line of the cache.

    Apparatus and method for throttling a ray tracing pipeline

    公开(公告)号:US11915357B2

    公开(公告)日:2024-02-27

    申请号:US16820483

    申请日:2020-03-16

    CPC classification number: G06T15/005 G06T15/06

    Abstract: Apparatus and method for stack throttling. For example, one embodiment of an apparatus comprises: execution circuitry comprising a plurality of functional units to execute a plurality of ray shaders and generate a plurality of primary rays and a corresponding plurality of ray messages; a first in first out (FIFO) buffer to queue the ray messages generated by the EUs; a cache to store one or more of the plurality of primary rays; a memory-backed stack to store a first subset of the plurality of ray messages in a corresponding plurality of entries; memory-backed stack management circuitry to either store a second subset of the plurality of ray messages to the memory-backed stack, or to temporarily store the one or more the second subset of the plurality of ray messages to a memory subsystem based, at least in part, on a number of entries currently occupied by ray messages in the memory-backed stack; and ray traversal circuitry to read a next ray message from the memory-backed stack, retrieve a next primary ray identified by the ray message from the cache or a memory subsystem, and perform traversal operations on the next primary ray.

Patent Agency Ranking