-
公开(公告)号:US20200349091A1
公开(公告)日:2020-11-05
申请号:US16881271
申请日:2020-05-22
Applicant: Intel Corporation
Inventor: Abhishek R. Appu , Joydeep Ray , James A. Valerio , Altug Koker , Prasoonkumar Surti , Balaji Vembu , Wenyin Fu , Bhushan M. Borole , Kamal Sinha
IPC: G06F12/128 , G06F12/0811 , G06F13/40 , G06F12/12 , G06T1/60 , G06F12/0897 , G06F12/084
Abstract: A hybrid hierarchical cache is implemented at the same level in the access pipeline, to get the faster access behavior of a smaller cache and, at the same time, a higher hit rate at lower power for a larger cache, in some embodiments. A split cache at the same level in the access pipeline includes two caches that work together. In the hybrid, split, low level cache (e.g., L1) evictions are coordinated locally between the two L1 portions, and on a miss to both L1 portions, a line is allocated from a larger L2 cache to the smallest L1 cache.
-
公开(公告)号:US20200341942A1
公开(公告)日:2020-10-29
申请号:US16847781
申请日:2020-04-14
Applicant: Intel Corporation
Inventor: Joydeep Ray , James A. Valerio , Altug Koker , Abhishek R. Appu , Vasanth Ranganathan
Abstract: A shared local memory data crossbar may be implemented in multiple stages. With this approach, the number of multiplexer cells can be reduced by fifty percent (50%) or more in some embodiments.
-
公开(公告)号:US10796397B2
公开(公告)日:2020-10-06
申请号:US14738679
申请日:2015-06-12
Applicant: INTEL CORPORATION
Inventor: James A. Valerio , Abhishek Venkatesh , Satyajit Sarangi , Michael Apodaca , Thomas F. Raoux , Hashem Hashemi , Rama S. B. Harihara
Abstract: A mechanism is described for facilitating dynamic runtime transformation of graphics processing commands for improved graphics performance on computing devices. A method of embodiments, as described herein, includes detecting a command stream associated with an application, where the command stream includes dispatches. The method may further include evaluating processing parameters relating to each of the dispatches, where evaluating further includes associating a first plan with one or more of the dispatches to transform the command stream into a transformed command stream. The method may further include associating, based on the first plan, a second plan to the one or more of the dispatches, where the second plan represents the transformed command stream. The method may further include executing the second plan, where execution of the second plan includes processing the transformed command stream in lieu of the command stream.
-
公开(公告)号:US10691617B2
公开(公告)日:2020-06-23
申请号:US16113174
申请日:2018-08-27
Applicant: Intel Corporation
Inventor: Abhishek R. Appu , Joydeep Ray , James A. Valerio , Altug Koker , Prasoonkumar P. Surti , Balaji Vembu , Wenyin Fu , Bhushan M. Borole , Kamal Sinha
IPC: G06F12/12 , G06F12/128 , G06F12/0811 , G06F13/40 , G06T1/60 , G06F12/0897 , G06F12/084
Abstract: A hybrid hierarchical cache is implemented at the same level in the access pipeline, to get the faster access behavior of a smaller cache and, at the same time, a higher hit rate at lower power for a larger cache, in some embodiments. A split cache at the same level in the access pipeline includes two caches that work together. In the hybrid, split, low level cache (e.g., L1) evictions are coordinated locally between the two L1 portions, and on a miss to both L1 portions, a line is allocated from a larger L2 cache to the smallest L1 cache.
-
公开(公告)号:US20200111454A1
公开(公告)日:2020-04-09
申请号:US16599175
申请日:2019-10-11
Applicant: Intel Corporation
Inventor: Joydeep Ray , Altug Koker , Balaji Vembu , Murali Ramadoss , Guei-Yuan Lueh , James A. Valerio , Prasoonkumar Surti , Abhishek R. Appu , Vasanth Ranganathan , Kalyan K. Bhiravabhatla , Arthur D. Hunter, JR. , Wei-Yu Chen , Subramaniam M. Maiyuran
IPC: G09G5/36 , G09G5/00 , G06F9/46 , G06F12/0875
Abstract: A mechanism is described for facilitating using of a shared local memory for register spilling/filling relating to graphics processors at computing devices. A method of embodiments, as described herein, includes reserving one or more spaces of a shared local memory (SLM) to perform one or more of spilling and filling relating to registers associated with a graphics processor of a computing device.
-
公开(公告)号:US20190317899A1
公开(公告)日:2019-10-17
申请号:US16394829
申请日:2019-04-25
Applicant: Intel Corporation
Inventor: Altug Koker , Joydeep Ray , James A. Valerio , Abhishek R. Appu , Vasanth Ranganathan
IPC: G06F12/0842 , G06F12/0831 , G06F12/0811 , G06T1/60
Abstract: A mechanism is described for facilitating independent and separate entity-based graphics cache at computing devices. A method of embodiments, as described herein, includes facilitate hosting of a plurality of cache at a plurality of entities associated with a graphics processor, wherein each entity hosts at least one cache, and wherein an entity includes a dual sub-slice (DSS) or a streaming multiprocessor (SM).
-
公开(公告)号:US20180307529A1
公开(公告)日:2018-10-25
申请号:US15493670
申请日:2017-04-21
Applicant: Intel Corporation
Inventor: Altug Koker , Joydeep Ray , Balaji Vembu , James A. Valerio , Abhishek R. Appu
CPC classification number: G06F9/4881 , G06T1/60 , G06T2210/52
Abstract: A mechanism is described for facilitating memory-based software barriers to emulate hardware barriers at graphics processors in computing devices. A method of embodiments, as described herein, includes facilitating converting thread scheduling at a processor from hardware barriers to software barriers, where the software barriers emulate the hardware barriers.
-
公开(公告)号:US10102149B1
公开(公告)日:2018-10-16
申请号:US15488840
申请日:2017-04-17
Applicant: Intel Corporation
Inventor: Abhishek R. Appu , Joydeep Ray , James A. Valerio , Altug Koker , Prasoonkumar P. Surti , Balaji Vembu , Wenyin Fu , Bhushan M. Borole , Kamal Sinha
IPC: G06F12/12 , G06F12/128 , G06F12/0811 , G06F13/40 , G06T1/20
Abstract: A hybrid hierarchical cache is implemented at the same level in the access pipeline, to get the faster access behavior of a smaller cache and, at the same time, a higher hit rate at lower power for a larger cache, in some embodiments. A split cache at the same level in the access pipeline includes two caches that work together. In the hybrid, split, low level cache (e.g., L1) evictions are coordinated locally between the two L1 portions, and on a miss to both L1 portions, a line is allocated from a larger L2 cache to the smallest L1 cache.
-
公开(公告)号:US20180293170A1
公开(公告)日:2018-10-11
申请号:US15483001
申请日:2017-04-10
Applicant: Intel Corporation
Inventor: Altug Koker , Joydeep Ray , James A. Valerio , Abhishek R. Appu , Vasanth Ranganathan
IPC: G06F12/0842
CPC classification number: G06F12/0842 , G06F2212/1008 , G06F2212/455
Abstract: A mechanism is described for facilitating independent and separate entity-based graphics cache at computing devices. A method of embodiments, as described herein, includes facilitate hosting of a plurality of cache at a plurality of entities associated with a graphics processor, wherein each entity hosts at least one cache, and wherein an entity includes a dual sub-slice (DSS) or a streaming multiprocessor (SM).
-
公开(公告)号:US20180011711A1
公开(公告)日:2018-01-11
申请号:US15203907
申请日:2016-07-07
Applicant: Intel Corporation
Inventor: Joydeep Ray , Abhishek R. Appu , James A. Valerio , Bharath Narasimha Swamy
CPC classification number: G06F9/3887 , G06F12/0806 , G06F12/084 , G06F13/1605 , G06F2212/314
Abstract: One embodiment provides for a graphics processor comprising first logic coupled with a first execution unit, the first logic to receive a first single instruction multiple data (SIMD) message from the first execution unit; second logic coupled with a second execution unit, the second logic to receive a second SIMD message from the second execution unit; and third logic coupled with a bank of shared local memory (SLM), the third logic to receive a first request to access the bank of SLM from the first logic, a second request to access the bank of SLM from the second logic, and in a single access cycle, schedule a read access to a read port for the first request and a write access to a write port for the second request.
-
-
-
-
-
-
-
-
-