-
公开(公告)号:US11037269B1
公开(公告)日:2021-06-15
申请号:US16832996
申请日:2020-03-27
Applicant: Intel Corporation
Inventor: Travis Schluessler , Zack Waters , Charles Moidel , Michael Apodaca , Murali Ramadoss
Abstract: Described herein are devices, systems and methods to utilize non-volatile memory to save and retrieve data that is used to accelerate the load and resume of GPU accelerated applications. Non-volatile memory and GPU logic are configured to enable the GPU to directly access the non-volatile memory to enable data to be read without requiring the data to traverse the CPU and CPU memory. This data access path creates a faster method for loading data into GPU local memory.
-
公开(公告)号:US10964087B2
公开(公告)日:2021-03-30
申请号:US16546658
申请日:2019-08-21
Applicant: Intel Corporation
Inventor: Devan Burke , Abhishek Venkatesh , Travis Schluessler
Abstract: One embodiment provides a graphics processor comprising a hardware graphics rendering pipeline configured to perform multisample anti-aliasing, the hardware graphics rendering pipeline including pixel processing logic to determine that each sample location of a pixel of a multisample surface is associated with a clear value and resolve a color value for the pixel to a non-multisample surface via a write of metadata to indicate that the pixel has the clear value. The resolve can be a stenciled resolve that automatically bypasses execution of a pixel shader for pixels having clear color data.
-
公开(公告)号:US10937126B2
公开(公告)日:2021-03-02
申请号:US15982680
申请日:2018-05-17
Applicant: Intel Corporation
Inventor: John Gierach , Abhishek Venkatesh , Travis Schluessler , Devan Burke , Tomer Bar-On , Michael Apodaca
Abstract: Embodiments are generally directed to tile-based multiple resolution rendering of images. An embodiment of an apparatus includes one or more processor cores; a plurality of tiling bins, the plurality of tiling bins including a bin for each of a plurality of tiles in an image; and a memory to store data for rendering of an image in one or more of a plurality of resolutions. The apparatus is to generate, in the memory, storage for a resolution setting for each the plurality of tiling bins and storage for a final render target, each tile of the final render target being rendered based on a respective tiling bin in the plurality of tiling bins.
-
公开(公告)号:US10832371B2
公开(公告)日:2020-11-10
申请号:US16236305
申请日:2018-12-28
Applicant: Intel Corporation
Inventor: Michael Doyle , Travis Schluessler , Gabor Liktor , Atsuo Kuwahara , Jefferson Amstutz
Abstract: An apparatus comprising a sorting unit to sort primitives of a graphics image, the primitives to be grouped, each group to form a first level node of a hierarchical acceleration structure; a parallel reconfigurable clustering array to construct the hierarchical acceleration structure, the parallel reconfigurable clustering array comprising a plurality of processing clusters, each cluster comprising: parallel efficiency analysis circuitry to evaluate different groupings of the first level nodes for a next level of the hierarchical acceleration structure to determine efficiency values for the different groupings; and node merge circuitry to merge the first level nodes based on the efficiency values to form second level nodes.
-
公开(公告)号:US20200219223A1
公开(公告)日:2020-07-09
申请号:US16243624
申请日:2019-01-09
Applicant: Intel Corporation
Inventor: Balaji Vembu , Brandon Fliflet , James Valerio , Michael Apodaca , Ben Ashbaugh , Hema Nalluri , Ankur Shah , Murali Ramadoss , David Puffer , Altug Koker , Aditya Navale , Abhishek R. Appu , Joydeep Ray , Travis Schluessler
Abstract: Embodiments described herein provide a graphics, media, and compute device having a tiled architecture composed of a number of tiles of smaller graphics devices. The work distribution infrastructure for such device enables the distribution of workloads across multiple tiles of the device. Work items can be submitted to any one or more of the multiple tiles, with workloads able to span multiple tiles. Additionally, upon completion of a work item, graphics, media, and/or compute engines within the device can readily acquire new work items for execution with minimal latency.
-
公开(公告)号:US10360717B1
公开(公告)日:2019-07-23
申请号:US15858396
申请日:2017-12-29
Applicant: Intel Corporation
Inventor: John G. Gierach , Travis Schluessler , Thomas F. Raoux , Peng Guo
Abstract: An apparatus and method for splitting shaders. For example, one embodiment of a method comprises: receiving a request for compilation of a shader in a graphics processing environment; determining whether there is sufficient work associated with the shader to justify splitting the shader into two or more blocks of program code; evaluating the program code of the shader to identify dependencies between the blocks of program code if there is sufficient work; subdividing the shader into the two or more blocks in accordance with the identified dependencies; and individually executing the two or more blocks of code on a graphics processor. In addition, one embodiment includes the operations of determining whether any of the regions that can be subdivided are likely to run faster with different machine configurations than if the shader is executed without being subdivided, and subdividing the shader only for those regions that are likely to run faster with different machine configurations.
-
公开(公告)号:US12182900B2
公开(公告)日:2024-12-31
申请号:US18231379
申请日:2023-08-08
Applicant: Intel Corporation
Inventor: Michael Doyle , Travis Schluessler , Gabor Liktor , Atsuo Kuwahara , Jefferson Amstutz
Abstract: An apparatus comprising a sorting unit to sort primitives of a graphics image, the primitives to be grouped, each group to form a first level node of a hierarchical acceleration structure; a parallel reconfigurable clustering array to construct the hierarchical acceleration structure, the parallel reconfigurable clustering array comprising a plurality of processing clusters, each cluster comprising: parallel efficiency analysis circuitry to evaluate different groupings of the first level nodes for a next level of the hierarchical acceleration structure to determine efficiency values for the different groupings; and node merge circuitry to merge the first level nodes based on the efficiency values to form second level nodes.
-
公开(公告)号:US11727528B2
公开(公告)日:2023-08-15
申请号:US17724299
申请日:2022-04-19
Applicant: Intel Corporation
Inventor: Michael Doyle , Travis Schluessler , Gabor Liktor , Atsuo Kuwahara , Jefferson Amstutz
CPC classification number: G06T1/20 , G06F9/3877 , G06F9/3891 , G06F9/5077 , G06F16/9027 , G06T15/005 , G06T15/06 , G06T15/10
Abstract: An apparatus comprising a sorting unit to sort primitives of a graphics image, the primitives to be grouped, each group to form a first level node of a hierarchical acceleration structure; a parallel reconfigurable clustering array to construct the hierarchical acceleration structure, the parallel reconfigurable clustering array comprising a plurality of processing clusters, each cluster comprising: parallel efficiency analysis circuitry to evaluate different groupings of the first level nodes for a next level of the hierarchical acceleration structure to determine efficiency values for the different groupings; and node merge circuitry to merge the first level nodes based on the efficiency values to form second level nodes.
-
公开(公告)号:US11688366B2
公开(公告)日:2023-06-27
申请号:US17461228
申请日:2021-08-30
Applicant: Intel Corporation
Inventor: Travis Schluessler , Abhishek Venkatesh , John Gierach , Tomer Bar-On , Devan Burke
IPC: G06T15/40 , G09G5/36 , G09G5/391 , G06T15/20 , G06T7/70 , G06T15/00 , G09G3/00 , G06T1/20 , G09G5/00 , G06T3/00 , G09G5/397 , G06T1/60 , G09G5/377 , G06F3/14
CPC classification number: G09G5/363 , G06T1/20 , G06T7/70 , G06T15/005 , G06T15/20 , G06T15/405 , G09G3/003 , G09G5/001 , G09G5/391 , G06F3/1446 , G06T1/60 , G06T3/0093 , G06T2210/08 , G06T2210/36 , G06T2210/52 , G09G5/377 , G09G5/397 , G09G2300/026 , G09G2320/0252 , G09G2330/023 , G09G2340/0407 , G09G2340/0428 , G09G2352/00 , G09G2354/00 , G09G2360/06 , G09G2360/08 , G09G2360/121 , G09G2360/122
Abstract: Systems, methods and apparatuses may provide for technology to reduce rendering overhead associated with light field displays. The technology may conduct data formatting, re-projection, foveation, tile binning and/or image warping operations with respect to a plurality of display planes in a light field display.
-
公开(公告)号:US11550600B2
公开(公告)日:2023-01-10
申请号:US17090295
申请日:2020-11-05
Applicant: Intel Corporation
Inventor: Li Xu , Haihao Xiang , Feng Chen , Travis Schluessler , Yuheng Zhang , Sen Lin
IPC: G06F9/448 , G06F16/215 , G06T1/60 , G06T1/20
Abstract: Embodiments are generally directed to a system and method for adapting executable object to a processing unit. An embodiment of a method to adapt an executable object from a first processing unit to a second processing unit, comprises: adapting the executable object optimized for the first processing unit of a first architecture, to the second processing unit of a second architecture, wherein the second architecture is different from the first architecture, wherein the executable object is adapted to perform on the second processing unit based on a plurality of performance metrics collected while the executable object is performed on the first processing unit and the second processing unit.
-
-
-
-
-
-
-
-
-