-
公开(公告)号:US20190163583A1
公开(公告)日:2019-05-30
申请号:US15823313
申请日:2017-11-27
Applicant: Intel Corporation
Inventor: Bahaa Fahim , Swadesh Choudhary , Rahul Pal , Vedaraman Geetha
Abstract: Operational faults, including transient faults, are detected within computing hardware for mission-critical applications. Operational requests received from a requestor node are to be processed by shared agents to produce corresponding responses. A first request is duplicated to be redundantly processed independently and asynchronously by distinct shared agents to produce redundant counterpart responses including a first redundant response and a second redundant response. The first redundant response is compared against the second redundant response. In response to a match, the redundant responses are merged to produce a single final response to the first request to be read by the requestor node. In response to a non-match, an exception response is performed.
-
公开(公告)号:US20180173533A1
公开(公告)日:2018-06-21
申请号:US15383832
申请日:2016-12-19
Applicant: Intel Corporation
Inventor: Niranjan K. Soundararajan , Sreenivas Subramoney , Rahul Pal , Ragavendra Natarajan
IPC: G06F9/38
Abstract: A processor may include a baseline branch predictor and an empirical branch bias override circuit. The baseline branch predictor may receive a branch instruction associated with a given address identifier, and generate, based on a global branch history, an initial prediction of a branch direction for the instruction. The empirical branch bias override circuit may determine, dependent on a direction of an observed branch direction bias in executed branch instruction instances associated with the address identifier, whether the initial prediction should be overridden, may determine, in response to determining that the initial prediction should be overridden, a final prediction that matches the observed branch direction bias, or may determine, in response determining that the initial prediction should not be overridden, a final prediction that matches the initial prediction. The predictor may update an entry in the global branch history reflecting the resolved branch direction for the instruction following its execution.
-
公开(公告)号:US09940238B2
公开(公告)日:2018-04-10
申请号:US15602603
申请日:2017-05-23
Applicant: INTEL CORPORATION
Inventor: Rahul Pal , Ishwar Agarwal , Manoj K. Arora
IPC: G06F12/08 , G06F12/084
CPC classification number: G06F12/084 , G06F2212/2542 , G06F2212/271
Abstract: A chip multiprocessor may include a first cluster and a second cluster, each having multiple cores of a processor, multiple co-located cache slices, and a memory controller. The processor stores directory information in a memory to indicate cluster cache ownership of a first address space to the first cluster. In response to a request to change the cluster cache ownership of the first address space to a second address space of the second cluster, the processor provides a quiesce period during which to block new read or write requests to the first cluster and the second cluster; drain read or write requests issued on the first cluster and the second cluster; and remove the block on new read or write requests. The processor may also update the directory information to change the cluster cache ownership of the first address space to the second address space of the second cluster.
-
公开(公告)号:US12294368B2
公开(公告)日:2025-05-06
申请号:US17485119
申请日:2021-09-24
Applicant: Intel Corporation
Inventor: Rahul Pal , Dheeraj Subbareddy , Mahesh Kumashikar , Dheemanth Nagaraj , Rajesh Vivekanandham , Anshuman Thakur , Ankireddy Nalamalpu , Md Altaf Hossain , Atul Maheshwari
IPC: H03K19/177 , G06F15/78 , G06F30/34 , H03K19/17758 , H03K19/17796 , H03K19/08
Abstract: The present disclosure is directed to 3-D stacked architecture for Programmable Fabrics and Central Processing Units (CPUs). The 3-D stacked orientation enables reconfigurability of the fabric, and allows the fabric to function using coarse-grained and fine-grained acceleration for offloading CPU processing. Additionally, the programmable fabric may be able to function to interface with multiple other compute chiplet components in the 3-D stacked orientation. This enables multiple compute components to communicate without the need for offloading the data communications between the compute chiplets.
-
公开(公告)号:US20210318980A1
公开(公告)日:2021-10-14
申请号:US17359321
申请日:2021-06-25
Applicant: Intel Corporation
Inventor: Rahul Pal , Nayan Amrutlal Suthar , David M. Puffer , Ashok Jagannathan
IPC: G06F13/42 , G06F12/0815 , G06T1/20
Abstract: A processor unit comprising a first controller to couple to a host processing unit over a first link; a second controller to couple to a second processor unit over a second link, wherein the second processor unit is to couple to the host central processing unit via a third link; and circuitry to determine whether to send a cache coherent request to the host central processing unit over the first link or over the second link via the second processing unit.
-
公开(公告)号:US20190220284A1
公开(公告)日:2019-07-18
申请号:US15870595
申请日:2018-01-12
Applicant: Intel Corporation
Inventor: Saurabh Gupta , Rahul Pal , Niranjan Soundararajan , Ragavendra Natarajan , Sreenivas Subramoney
IPC: G06F9/38
CPC classification number: G06F9/3844 , G06F9/3806 , G06F9/3859 , G06F9/3861
Abstract: One embodiment provides an apparatus. The apparatus includes a store direct dependent (SDD) branch prediction circuitry and an SDD management circuitry. The store direct dependent (SDD) branch prediction circuitry is to store an SDD branch table. The SDD branch table is to store at least one record. Each record includes a branch instruction pointer (IP) field, a load IP field, a store IP field, a comparison info field and at least one of a store value field and/or a predicted outcome field. The SDD management circuitry is to populate the SDD branch table at runtime and to override a baseline branch prediction associated with an incoming branch IP with an SDD branch prediction, if the SDD branch table contains a first record populated with the incoming branch IP and at least one of a store value and/or an SDD predicted outcome.
-
公开(公告)号:US20170269959A1
公开(公告)日:2017-09-21
申请号:US15070146
申请日:2016-03-15
Applicant: Intel Corporation
Inventor: Ishwar Agarwal , Eric R. Wehage , David M. Lee , Swadesh Choudhary , Rahul Pal
CPC classification number: G06F13/4068 , G06F13/16
Abstract: In one embodiment, an apparatus comprises: an encoder to receive a non-posted transaction from a requester and encode information of the non-posted transaction into an encoded transaction identifier having a predetermined root bus identifier reserved for non-posted transactions; and a first transmitter to send the non-posted transaction including the encoded transaction identifier to a fabric, to enable the non-posted transaction to be routed to a destination. Other embodiments are described and claimed.
-
公开(公告)号:US09690706B2
公开(公告)日:2017-06-27
申请号:US14668831
申请日:2015-03-25
Applicant: Intel Corporation
Inventor: Rahul Pal , Ishwar Agarwal , Manoj K. Arora
IPC: G06F12/08 , G06F12/084
CPC classification number: G06F12/084 , G06F2212/2542 , G06F2212/271
Abstract: Resolving coherency issues inherent in sharing distributed cache is described. A chip multiprocessor may include at least first and second processing clusters, each having multiple cores of a processor, multiple cache slices co-located with the multiple cores, and a memory controller (MC). The processor stores directory information in a memory coupled to the processor to indicate cluster cache ownership of a first address space to the first cluster. In response to a request to change the cluster cache ownership of the first address space, the processor may remap first lines of first cache slices, corresponding to the first address space, to second lines in second cache slices of the second cluster, and update the directory information (e.g., a state of the first cache lines) to change the cluster cache ownership of the first address space to the second cluster. One of the MCs may manage such updating of the directory.
-
公开(公告)号:US09552308B2
公开(公告)日:2017-01-24
申请号:US14126919
申请日:2013-09-30
Applicant: Intel Corporation
Inventor: Suresh Sugumar , Mahesh K. Kumashikar , Rahul Pal , Sridhar Muthrasanallur
CPC classification number: G06F13/00 , G06F1/3206 , G06F9/46 , G06F13/4273 , Y02D10/14 , Y02D10/151
Abstract: A request associated with a particular cache record is generated to be sent to a system component associated with a cache bank over an interconnect. A wake-warn signal is sent over a dedicated wake-warn channel to indicate to the system component that the request is to arrive. Wake-warn signals cause a disabled clock to be ungated to an enabled state. The request is then sent to the system component.
Abstract translation: 生成与特定高速缓存记录相关联的请求,以通过互连发送到与高速缓存组相关联的系统组件。 唤醒警告信号通过专用的唤醒通道发送,以向系统组件指示请求到达。 唤醒信号导致禁用的时钟被禁止到启用状态。 然后将请求发送到系统组件。
-
公开(公告)号:US20240031308A1
公开(公告)日:2024-01-25
申请号:US18478755
申请日:2023-09-29
Applicant: Intel Corporation
Inventor: Rahul Pal , Ashish Gupta , Keong Hong Oh , Gia Thuyet Ngo , Vikrant Kapila , Ankita Roy
IPC: H04L49/109
CPC classification number: H04L49/109
Abstract: An integrated circuit includes a core region of logic circuits and a network routed outside the core region. The network includes a wide layer and a narrow layer. The wide layer comprises first routers coupled in series. The narrow layer comprises second routers coupled in series.
-
-
-
-
-
-
-
-
-