-
公开(公告)号:US20240338238A1
公开(公告)日:2024-10-10
申请号:US18574849
申请日:2022-01-26
Applicant: Intel Corporation
Inventor: Wei Wang , Kun Tian , Guang Zeng , Gilbert Neiger , Rajesh Sankaran , Asit Mallick , Jr-Shian Tsai , Jacob Jun Pan , Mesut Ergin
CPC classification number: G06F9/45558 , G06F9/3016 , G06F9/45545 , G06F2009/45579
Abstract: A method and system of host to guest (H2G) notification are disclosed. H2G is provided via an instruction. The instruction is a send user inter-processor interrupt instruction. An exemplary processor includes decoder circuitry to decode a single instruction and execute the decoded single instruction according to the at least the opcode to cause a host to guest notification from a virtual device running in a host machine on the first physical processor to a virtual device driver running on a virtual processor in a guest machine on a second physical processor.
-
公开(公告)号:US12045640B2
公开(公告)日:2024-07-23
申请号:US16909084
申请日:2020-06-23
Applicant: Intel Corporation
Inventor: Sanjay K. Kumar , Philip Lantz , Rajesh Sankaran , Narayan Ranganathan , Saurabh Gayen , David A. Koufaty , Utkarsh Y. Kakaiya
CPC classification number: G06F9/45558 , G06F9/342 , G06F9/4875 , G06F12/10 , G06F12/109 , G06F12/1441 , G06F2009/45579 , G06F2009/45583 , G06F12/145 , G06F12/1475 , G06F12/1483 , G06F2212/1016 , G06F2212/152 , G06F2212/656 , G06F2212/657
Abstract: In one embodiment, a data mover accelerator is to receive, from a first agent having a first address space and a first process address space identifier (PASID) to identify the first address space, a first job descriptor comprising a second PASID selector to specify a second PASID to identify a second address space. In response to the first job descriptor, the data mover accelerator is to securely access the first address space and the second address space. Other embodiments are described and claimed.
-
公开(公告)号:US20240232096A9
公开(公告)日:2024-07-11
申请号:US18279029
申请日:2021-03-25
Applicant: Intel Corporation
Inventor: Sanjay Kumar , Phillip Lantz , Rajesh Sankaran , David Hansen , Evgeny V. Voevodin , Andrew Anderson , Lizhen You , Xin Zhou , Nikhil Talpallikar
IPC: G06F12/1009
CPC classification number: G06F12/1009
Abstract: An embodiment of an integrated circuit may comprise an array of hardware counters, and circuitry communicatively coupled to the array of hardware counters, the circuitry to count accesses to one or more selected pages of a memory with the array of hardware counters. Other embodiments are disclosed and claimed.
-
公开(公告)号:US11748130B2
公开(公告)日:2023-09-05
申请号:US16456300
申请日:2019-06-28
Applicant: INTEL CORPORATION
Inventor: Rajesh Sankaran , Bret Toll , William Rash , Subramaniam Maiyuran , Gang Chen , Varghese George
IPC: G06F9/455 , G06F12/1009 , G06T1/20
CPC classification number: G06F9/45558 , G06F12/1009 , G06T1/20 , G06F2009/4557 , G06F2009/45583 , G06F2009/45591
Abstract: Graphics processing systems and methods are described. A graphics processing apparatus may comprise one or more graphics processing engines, a memory, a memory management unit (MMU) including a GPU second level page table and GPU dirty bit tracking, and a provisioning agent to receive a request from a virtual machine monitor (VMM) to provision a subcluster of graphics processing apparatuses, the subcluster including a plurality of graphics processing engines from a plurality of graphics processing apparatuses connected using a scale-up fabric, provision the scale-up fabric to route data within the subcluster of graphics processing apparatuses, and provision a plurality of resources on the graphics processing apparatus for the subcluster based on the request from the VMM.
-
公开(公告)号:US20220398017A1
公开(公告)日:2022-12-15
申请号:US17348586
申请日:2021-06-15
Applicant: Intel Corporation
Inventor: Utkarsh Y. Kakaiya , David Koufaty , Rajesh Sankaran , Vedvyas Shanbhogue
Abstract: An embodiment of an integrated circuit comprises circuitry to share page tables associated with a page between a processor memory management unit (MMU) and an input/output memory management unit (IOMMU), store a page table entry in the memory associated with the page, and separately control access to the page from a processor and from a direct memory access (DMA) request based on one or more fields of the stored page table entry. Other embodiments are disclosed and claimed.
-
公开(公告)号:US11379236B2
公开(公告)日:2022-07-05
申请号:US16728665
申请日:2019-12-27
Applicant: Intel Corporation
Inventor: Pratik Marolia , Rajesh Sankaran
IPC: G11C16/04 , G06F9/30 , G06F9/38 , G06F12/1027
Abstract: An apparatus and method for hybrid software-hardware coherency. An apparatus comprises one or more processing elements to process data; a memory controller to couple the one or more processing elements to a device memory; an interconnect to couple the one or more processing elements to a host processor memory and to couple a host processor to the device memory; one or more device caches to store cache lines read from the host processor memory and/or the device memory; coherency circuitry to manage an ownership indication for each cache line, the ownership indication to be set to a first value to indicate ownership by the host processor and to be set to a second value to indicate ownership by the processing device, wherein the coherency circuitry is to transfer ownership of a first cache line from the processing device to the host processor by updating the ownership indication from the second value to the first value, the coherency circuitry to provide indirect access to the cache line by the processing device while the ownership indication is set to the first value, the coherency circuitry to maintain the ownership indication at the first value until receiving a request to change the ownership indication.
-
公开(公告)号:US11201838B2
公开(公告)日:2021-12-14
申请号:US16582224
申请日:2019-09-25
Applicant: Intel Corporation
Inventor: Pratik Marolia , Rajesh Sankaran , Ishwar Agarwal , Nitish Paliwal
IPC: H04L12/935 , H04L29/06
Abstract: In one embodiment, an input/output port includes a stateful transmit port having: a history storage to store a value corresponding to a transmit on change field of a prior data packet; a comparator to compare a transmit on change field of the data packet to the value stored in the history storage; and a selection circuit to output the data packet without the transmit on change field when the transmit on change field of the data packet matches the value. Other embodiments are described and claimed.
-
公开(公告)号:US20210064525A1
公开(公告)日:2021-03-04
申请号:US16958479
申请日:2018-01-02
Applicant: Intel Corporation
Inventor: Kun Tian , Rajesh Sankaran , Sanjay Kumar , Ashok Raj
IPC: G06F12/06 , G06F12/1081 , G06F12/1036 , G06F12/0891 , G06F9/455
Abstract: A processor includes a hardware input/output (I/O) memory management unit (IOMMU) and a core, which executes an instruction to intercept a payload from a virtual machine (VM). The payload contains a guest bus device function (BDF) identifier, a guest address space identifier (ASID), and a guest address range. The core accesses, within a virtual machine control structure stored in memory, pointers to a first set of translation tables and a second set of translation tables. The core traverses the first set of translation tables to translate the guest BDF identifier to a host BDF identifier and traverses the second set of translation tables to translate the guest ASID to a host ASID. The core stores the host BDF identifier and the host ASID in the payload and submits, to the hardware IOMMU, an administrative command containing the payload to perform invalidation of the guest address range.
-
公开(公告)号:US10817425B2
公开(公告)日:2020-10-27
申请号:US14583389
申请日:2014-12-26
Applicant: Intel Corporation
Inventor: Ren Wang , Andrew J. Herdrich , Yen-cheng Liu , Herbert H. Hum , Jong Soo Park , Christopher J. Hughes , Namakkal N. Venkatesan , Adrian C. Moga , Aamer Jaleel , Zeshan A. Chishti , Mesut A. Ergin , Jr-shian Tsai , Alexander W. Min , Tsung-yuan C. Tai , Christian Maciocco , Rajesh Sankaran
IPC: G06F12/0842 , G06F12/0831 , G06F12/0893 , G06F12/109 , G06F12/0813 , G06F9/455
Abstract: Methods and apparatus implementing Hardware/Software co-optimization to improve performance and energy for inter-VM communication for NFVs and other producer-consumer workloads. The apparatus include multi-core processors with multi-level cache hierarchies including and L1 and L2 cache for each core and a shared last-level cache (LLC). One or more machine-level instructions are provided for proactively demoting cachelines from lower cache levels to higher cache levels, including demoting cachelines from L1/L2 caches to an LLC. Techniques are also provided for implementing hardware/software co-optimization in multi-socket NUMA architecture system, wherein cachelines may be selectively demoted and pushed to an LLC in a remote socket. In addition, techniques are disclosure for implementing early snooping in multi-socket systems to reduce latency when accessing cachelines on remote sockets.
-
公开(公告)号:US10761996B2
公开(公告)日:2020-09-01
申请号:US16147191
申请日:2018-09-28
Applicant: Intel Corporation
Inventor: Vedvyas Shanbhogue , Ravi Sahita , Rajesh Sankaran , Siddhartha Chhabra , Abhishek Basak , Krystof Zmudzinski , Rupin Vakharwala
Abstract: Examples include an apparatus which accesses secure pages in a trust domain using secure lookups in first and second sets of page tables. For example, one embodiment of the processor comprises: a decoder to decode a plurality of instructions including instructions related to a trusted domain; execution circuitry to execute a first one or more of the instructions to establish a first trusted domain using a first trusted domain key, the trusted domain key to be used to encrypt memory pages within the first trusted domain; and the execution circuitry to execute a second one or more of the instructions to associate a first process address space identifier (PASID) with the first trusted domain, the first PASID to uniquely identify a first execution context associated with the first trusted domain.
-
-
-
-
-
-
-
-
-