Dynamically Managing Memory Footprint for Tile Based Rendering
    51.
    发明申请
    Dynamically Managing Memory Footprint for Tile Based Rendering 有权
    动态管理基于平铺的渲染的内存占用

    公开(公告)号:US20160275920A1

    公开(公告)日:2016-09-22

    申请号:US14662603

    申请日:2015-03-19

    CPC classification number: G09G5/39 G06T1/60 G06T15/005 G06T15/40

    Abstract: The introduction of an “out-of-memory” marker in the sorted tile geometry sequence for a tile may aid in handling out-of-memory frames. This marker allows hardware to continue rendering using the original data stream instead of the sorted data stream. This enables use of the original data stream allows the system to continue rendering without requiring any driver intervention. During the visibility generation/sorting phase, the number of memory pages required for storing the data for a rendering pass is continuously tracked. This tracking includes tracking the pages that are required even if the hardware had not run out-of-memory. This information can be monitored by a graphics driver and the driver can provide more memory pages for the system to work at full efficiency.

    Abstract translation: 在瓦片的排序瓦片几何序列中引入“内存不足”标记可能有助于处理超出内存的帧。 该标记允许硬件使用原始数据流而不是排序的数据流继续渲染。 这使得能够使用原始数据流允许系统继续呈现而不需要任何驱动器干预。 在可见性生成/分类阶段期间,持续追踪存储渲染通过数据所需的存储器页数。 该跟踪包括跟踪所需的页面,即使硬件没有运行内存不足。 该信息可以由图形驱动程序监视,驱动程序可以为系统提供更多的内存页面,以便全面工作。

    Apparatus and method for ray tracing instruction processing and execution

    公开(公告)号:US12236519B2

    公开(公告)日:2025-02-25

    申请号:US18090810

    申请日:2022-12-29

    Abstract: An apparatus and method to execute ray tracing instructions. For example, one embodiment of an apparatus comprises execution circuitry to execute a dequantize instruction to convert a plurality of quantized data values to a plurality of dequantized data values, the dequantize instruction including a first source operand to identify a plurality of packed quantized data values in a source register and a destination operand to identify a destination register in which to store a plurality of packed dequantized data values, wherein the execution circuitry is to convert each packed quantized data value in the source register to a floating point value, to multiply the floating point value by a first value to generate a first product and to add the first product to a second value to generate a dequantized data value, and to store the dequantized data value in a packed data element location in the destination register.

    Apparatus and method for acceleration data structure refit

    公开(公告)号:US12229870B2

    公开(公告)日:2025-02-18

    申请号:US17982766

    申请日:2022-11-08

    Abstract: Apparatus and method for acceleration data structure refit. For example, one embodiment of an apparatus comprises: a ray generator to generate a plurality of rays in a first graphics scene; a hierarchical acceleration data structure generator to construct an acceleration data structure comprising a plurality of hierarchically arranged nodes including inner nodes and leaf nodes stored in a memory in a depth-first search (DFS) order; traversal hardware logic to traverse one or more of the rays through the acceleration data structure; intersection hardware logic to determine intersections between the one or more rays and one or more primitives within the hierarchical acceleration data structure; a node refit unit comprising circuitry and/or logic to read consecutively through at least the inner nodes in the memory in reverse DFS order to perform a bottom-up refit operation on the hierarchical acceleration data structure.

    Thread modification to reduce command conversion latency

    公开(公告)号:US11948017B2

    公开(公告)日:2024-04-02

    申请号:US16896031

    申请日:2020-06-08

    CPC classification number: G06F9/542 G06F9/546 G06T1/20

    Abstract: Examples described herein relate to a graphics processing apparatus that includes a memory device; and a central processing unit (CPU). In some examples, the CPU is configured to: execute a producer to issue graphics command application program interfaces (APIs); execute a driver to translate graphics command APIs into executable instructions; and based on an idle state of the producer, execute a command translation code segment of the producer to translate graphics command APIs into executable instructions. In some examples, the execution unit is coupled to the memory device, the execution unit to execute one or more of the executable instructions. In some examples, the producer includes multiple portions such as application code, graphics pipeline runtime code, and command translation code segment.

    Mutli-frame renderer
    59.
    发明授权

    公开(公告)号:US11132759B2

    公开(公告)日:2021-09-28

    申请号:US16237987

    申请日:2019-01-02

    Abstract: An embodiment of a graphics command coordinator apparatus may include a commonality identifier to identify a commonality between a first graphics command corresponding to a first frame and a second graphics command corresponding to a second frame, a commonality analyzer communicatively coupled to the commonality identifier to determine if the first graphics command and the second graphics command can be processed together based on the commonality identified by the commonality identifier, and a commonality indicator communicatively coupled to the commonality analyzer to provide an indication that the first graphics command and the second graphics command are to be processed together. Other embodiments are disclosed and claimed.

Patent Agency Ranking