-
公开(公告)号:US20230062540A1
公开(公告)日:2023-03-02
申请号:US17405957
申请日:2021-08-18
Applicant: Intel Corporation
Inventor: Prasoonkumar SURTI , Abhishek R. APPU , Karol A. SZERSZEN , Karthik VAIDYANATHAN , Sreenivas KOTHANDARAMAN , Mohamed FAROOK
Abstract: Examples described herein relate to a manner of determining a number of bits to encode compression data. Some examples include: compressing pixel data of a region of pixels in a frame; determining a number of bits associated with at least two partitions; utilizing the determined number of bits to encode residual values generated from the compressing the pixel data; and storing the encoded residual values. In some examples, the at least two partitions comprise a first partition and a second partition. Some examples include: encoding residuals in the first partition using a number of bits associated with the first partition and encoding residuals in the second partition using a number of bits associated with the second partition. Some examples include: determining a distribution of bins of residuals, wherein each different bin represents a number of bits used to encode a residual value and determining a midpoint of a total number of residuals as a bin that stores a residual that is approximately 50 percentile of the total number of residuals of the distribution.
-
公开(公告)号:US20230030741A1
公开(公告)日:2023-02-02
申请号:US17390661
申请日:2021-07-30
Applicant: Intel Corporation
Inventor: Nilay MISTRY , Karol A. SZERSZEN , Prasoonkumar SURTI , Ronald W. SILVAS
Abstract: Compressed verbatim copy can enable more efficient copying of compressed data. In one example, a compressed verbatim copy method involves receiving a command to copy compressed data from a source address of the memory device to a destination address. In response to the receipt of the command, the method involves copying the compressed data in a compressed format from the source address to the destination address without first decompressing the data. A second source address and a second destination address of metadata for the compressed data is determined, and the metadata is copied from the second source address to the second destination address.
-
公开(公告)号:US20220262063A1
公开(公告)日:2022-08-18
申请号:US17677109
申请日:2022-02-22
Applicant: INTEL CORPORATION
Inventor: Karol SZERSZEN , Prasoonkumar SURTI , Gabor LIKTOR , Karthik VAIDYANATHAN , Sven WOOP
Abstract: Apparatus and method for grouping rays based on quantized ray directions. For example, one embodiment of an apparatus comprises: An apparatus comprising: a ray generator to generate a plurality of rays; ray direction evaluation circuitry/logic to generate approximate ray direction data for each of the plurality of rays; ray sorting circuitry/logic to sort the rays into a plurality of ray queues based, at least in part, on the approximate ray direction data.
-
公开(公告)号:US20220138894A1
公开(公告)日:2022-05-05
申请号:US17521009
申请日:2021-11-08
Applicant: Intel Corporation
Inventor: Prasoonkumar SURTI , Larry SEILER , Adam Z. LEIBEL
Abstract: Systems and methods may provide for receiving a pixel shader and sending the pixel shader to shader bypass hardware if the pixel shader and a render target associated with the pixel shader satisfy a simplicity condition. In one example, the shader bypass hardware is dedicated to pixel shaders and associated render targets that satisfy the simplicity condition.
-
公开(公告)号:US20210149680A1
公开(公告)日:2021-05-20
申请号:US17095585
申请日:2020-11-11
Applicant: Intel Corporation
Inventor: Christopher J. HUGHES , Prasoonkumar SURTI , Guei-Yuan LUEH , Adam T. LAKE , Jill BOYCE , Subramaniam MAIYURAN , Lidong XU , James M. HOLLAND , Vasanth RANGANATHAN , Nikos KABURLASOS , Altug KOKER , Abhishek R. APPU
IPC: G06F9/38 , G06F12/084 , G06F9/54 , G06F9/50 , G06T1/60
Abstract: Embodiments described herein provide an apparatus comprising a plurality of processing resources including a first processing resource and a second processing resource, a memory communicatively coupled to the first processing resource and the second processing resource, and a processor to receive data dependencies for one or more tasks comprising one or more producer tasks executing on the first processing resource and one or more consumer tasks executing on the second processing resource and move a data output from one or more producer tasks executing on the first processing resource to a cache memory communicatively coupled to the second processing resource. Other embodiments may be described and claimed.
-
6.
公开(公告)号:US20200310809A1
公开(公告)日:2020-10-01
申请号:US16366155
申请日:2019-03-27
Applicant: Intel Corporation
Inventor: Christopher J. HUGHES , Jonathan D. PEARCE , Guei-Yuan LUEH , ElMoustapha OULD-AHMED-VALL , Jorge E. PARRA , Prasoonkumar SURTI , Krishna N. VINOD , Ronen ZOHAR
IPC: G06F9/30
Abstract: Embodiments detailed herein relate to reduction operations on a plurality of data element values. In one embodiment, a process comprises decoding circuitry to decode an instruction and execution circuitry to execute the decoded instruction. The instruction specifies a first input register containing a plurality of data element values, a first index register containing a plurality of indices, and an output register, where each index of the plurality of indices maps to one unique data element position of the first input register. The execution includes to identify data element values that are associated with one another based on the indices, perform one or more reduction operations on the associated data element values based on the identification, and store results of the one or more reduction operations in the output register.
-
公开(公告)号:US20200310804A1
公开(公告)日:2020-10-01
申请号:US16370922
申请日:2019-03-30
Applicant: Intel Corporation
Inventor: Christopher J. HUGHES , ElMoustapha OULD-AHMED-VALL , Jorge E. PARRA , Prasoonkumar SURTI , Krishna N. VINOD , Ronen ZOHAR
Abstract: Methods and apparatus for vector-matrix comparison are disclosed. In one embodiment, a processor comprises decoding and execution circuitry. The decoding circuitry decodes an instruction, where operands of the instruction specifies an output location to store output results, a vector of data element values, and a matrix of data element values. The execution circuitry executes the decoded instruction. The execution includes to map each of the data element values of the vector to one of consecutive rows of the matrix; for each data element value of the vector, to compare that data element value of the vector with data element values in a respective row of the matrix and obtain data element match results. The execution further includes to store the output results based on the data element match results, where each output result maps to a respective data element column position and indicates a vector match result.
-
公开(公告)号:US20250103343A1
公开(公告)日:2025-03-27
申请号:US18955259
申请日:2024-11-21
Applicant: INTEL CORPORATION
Inventor: Christopher J. HUGHES , Prasoonkumar SURTI , Guei-Yuan LUEH , Adam T. LAKE , Jill BOYCE , Subramaniam MAIYURAN , Lidong XU , James M. HOLLAND , Vasanth RANGANATHAN , Nikos KABURLASOS , Altug KOKER , Abhishek R. Appu
IPC: G06F9/38 , G06F9/50 , G06F9/54 , G06F12/084 , G06T1/60
Abstract: Embodiments described herein provide an apparatus comprising a plurality of processing resources including a first processing resource and a second processing resource, a memory communicatively coupled to the first processing resource and the second processing resource, and a processor to receive data dependencies for one or more tasks comprising one or more producer tasks executing on the first processing resource and one or more consumer tasks executing on the second processing resource and move a data output from one or more producer tasks executing on the first processing resource to a cache memory communicatively coupled to the second processing resource. Other embodiments may be described and claimed.
-
公开(公告)号:US20240004713A1
公开(公告)日:2024-01-04
申请号:US18363339
申请日:2023-08-01
Applicant: Intel Corporation
Inventor: Abhishek R. APPU , Altug KOKER , Balaji VEMBU , Joydeep RAY , Kamal SINHA , Prasoonkumar SURTI , Kiran C. VEERNAPU , Subramaniam MAIYURAN , Sanjeev S. Jahagirdar , Eric J. Asperheim , Guei-Yuan Lueh , David Puffer , Wenyin Fu , Nikos Kaburlasos , Bhushan M. Borole , Josh B. Mastronarde , Linda L. Hurd , Travis T. Schluessler , Tomasz Janczak , Abhishek Venkatesh , Kai Xiao , Slawomir Grajewski
CPC classification number: G06F9/5016 , G06F9/5044 , G06F1/329 , G06F9/4893 , G06T1/20 , G06T1/60 , G06T15/005 , Y02D10/00 , G06T2200/28
Abstract: In an example, an apparatus comprises a plurality of execution units comprising at least a first type of execution unit and a second type of execution unit and logic, at least partially including hardware logic, to analyze a workload and assign the workload to one of the first type of execution unit or the second type of execution unit. Other embodiments are also disclosed and claimed.
-
公开(公告)号:US20220343554A1
公开(公告)日:2022-10-27
申请号:US17740754
申请日:2022-05-10
Applicant: INTEL CORPORATION
Inventor: Carson BROWNLEE , Carsten BENTHIN , Joshua BARCZAK , Kai XIAO , Michael APODACA , Prasoonkumar SURTI , Thomas RAOUX
Abstract: Apparatus and method for context-aware compression. For example, one embodiment of an apparatus comprises: ray traversal/intersection circuitry to traverse rays through a hierarchical acceleration data structure to identify intersections between rays and primitives of a graphics scene; matrix compression circuitry/logic to compress hierarchical transformation matrices to generate compressed hierarchical transformation matrices by quantizing N-bit floating point data elements associated with child transforms of the hierarchical transformation matrices to variable-bit floating point numbers or integers comprising offsets from a parent transform of the child transform; and an instance processor to generate a plurality of instances of one or more base geometric objects in accordance with the compressed hierarchical transformation matrices.
-
-
-
-
-
-
-
-
-