ISOLATING COMMUNICATION STREAMS TO ACHIEVE HIGH PERFORMANCE MULTI-THREADED COMMUNICATION FOR GLOBAL ADDRESS SPACE PROGRAMS

    公开(公告)号:US20240330084A1

    公开(公告)日:2024-10-03

    申请号:US18525553

    申请日:2023-11-30

    CPC classification number: G06F9/544 G06F9/52

    Abstract: Systems, apparatuses and methods may provide for detecting an outbound communication and identifying a context of the outbound communication. Additionally, a completion status of the outbound communication may be tracked relative to the context. In one example, tracking the completion status includes incrementing a sent messages counter associated with the context in response to the outbound communication, detecting an acknowledgement of the outbound communication based on a network response to the outbound communication, incrementing a received acknowledgements counter associated with the context in response to the acknowledgement, comparing the sent messages counter to the received acknowledgements counter, and triggering a per-context memory ordering operation if the sent messages counter and the received acknowledgements counter have matching values.

    Technologies for Automatic Processor Core Association Management and Communication Using Direct Data Placement in Private Caches

    公开(公告)号:US20170094010A1

    公开(公告)日:2017-03-30

    申请号:US14864369

    申请日:2015-09-24

    CPC classification number: H04L67/2852 G06F9/46

    Abstract: Technologies for communication with direct data placement include a number of computing nodes in communication over a network. Each computing node includes a many-core processor having an integrated host fabric interface (HFI) that maintains an association table (AT). In response to receiving a message from a remote device, the HFI determines whether the AT includes an entry associating one or more parameters of the message to a destination processor core. If so, the HFI causes a data transfer agent (DTA) of the destination core to receive the message data. The DTA may place the message data in a private cache of the destination core. Message parameters may include a destination process identifier or other network address and a virtual memory address range. The HFI may automatically update the AT based on communication operations generated by software executed by the processor cores. Other embodiments are described and claimed.

    TECHNOLOGIES FOR HANDLING MESSAGE PASSING INTERFACE OPERATIONS

    公开(公告)号:US20170085625A1

    公开(公告)日:2017-03-23

    申请号:US14858051

    申请日:2015-09-18

    Abstract: Technologies for handling message passing interface receive operations include a compute node to determine a plurality of parameters of a receive entry to be posted and determine whether the plurality of parameters includes a wildcard entry. The compute node generates a hash based on at least one parameter of the plurality of parameters in response to determining that the plurality of parameters does not include the wildcard entry and appends the receive entry to a list in a bin of a posted receive data structure, wherein the bin is determined based on the generated hash. The compute node further tracks the wildcard entry in the posted receive data structure in response to determining the plurality of parameters includes the wildcard entry and appends the receive entry to a wildcard list of the posted receive data structure in response to tracking the wildcard entry.

    TECHNOLOGIES FOR SCALABLE REMOTELY ACCESSIBLE MEMORY SEGMENTS
    4.
    发明申请
    TECHNOLOGIES FOR SCALABLE REMOTELY ACCESSIBLE MEMORY SEGMENTS 审中-公开
    可扩展的远程可用内存部分技术

    公开(公告)号:US20160314073A1

    公开(公告)日:2016-10-27

    申请号:US14696719

    申请日:2015-04-27

    CPC classification number: G06F15/17331 G06F3/061 G06F3/0659 G06F3/067

    Abstract: Technologies for one-side remote memory access communication include multiple computing nodes in communication over a network. A receiver computing node receives a message from a sender node and extracts a segment identifier from the message. The receiver computing node determines, based on the segment identifier, a segment start address associated with a partitioned global address space (PGAS) segment of its local memory. The receiver computing node may index a segment table stored in the local memory or in a host fabric interface. The receiver computing node determines a local destination address within the PGAS segment based on the segment start address and an offset included in the message. The receiver computing node performs a remote memory access operation at the local destination address. The receiver computing node may perform those operations in hardware by the host fabric interface of the receiver computing node. Other embodiments are described and claimed.

    Abstract translation: 用于单侧远程存储器访问通信的技术包括通过网络进行通信的多个计算节点。 接收机计算节点从发送方节点接收消息,并从消息中提取分段标识符。 接收机计算节点基于段标识符确定与其本地存储器的分区全局地址空间(PGAS)段相关联的段起始地址。 接收机计算节点可以索引存储在本地存储器或主机结构接口中的段表。 接收机计算节点基于段起始地址和消息中包含的偏移来确定PGAS段内的本地目的地址。 接收机计算节点在本地目的地地址执行远程存储器访问操作。 接收机计算节点可以由接收机计算节点的主机结构接口在硬件中执行这些操作。 描述和要求保护其他实施例。

    Isolating communication streams to achieve high performance multi-threaded communication for global address space programs

    公开(公告)号:US11989599B2

    公开(公告)日:2024-05-21

    申请号:US16880277

    申请日:2020-05-21

    CPC classification number: G06F9/544 G06F9/52

    Abstract: Systems, apparatuses and methods may provide for detecting an outbound communication and identifying a context of the outbound communication. Additionally, a completion status of the outbound communication may be tracked relative to the context. In one example, tracking the completion status includes incrementing a sent messages counter associated with the context in response to the outbound communication, detecting an acknowledgement of the outbound communication based on a network response to the outbound communication, incrementing a received acknowledgements counter associated with the context in response to the acknowledgement, comparing the sent messages counter to the received acknowledgements counter, and triggering a per-context memory ordering operation if the sent messages counter and the received acknowledgements counter have matching values.

    METHODS AND APPARATUS TO IMPROVE PERFORMANCE DATA COLLECTION OF A HIGH PERFORMANCE COMPUTING APPLICATION

    公开(公告)号:US20220334948A1

    公开(公告)日:2022-10-20

    申请号:US17856148

    申请日:2022-07-01

    Abstract: Methods, apparatus, systems and articles of manufacture to improve performance data collection are disclosed. An example apparatus includes a performance data comparator of a source node to collect the performance data of an application of the source node from the host fabric interface at a polling frequency; an interface to transmit a write back instruction to the host fabric interface, the write back instruction to cause data to be written to a memory address location of memory of the source node to trigger a wake up mode; and a frequency selector to: start the polling frequency to a first polling frequency for a sleep mode; and increase the polling frequency to a second polling frequency in response to the data in the memory address location identifying the wake mode.

    Technologies for fine-grained completion tracking of memory buffer accesses

    公开(公告)号:US10963183B2

    公开(公告)日:2021-03-30

    申请号:US15463005

    申请日:2017-03-20

    Abstract: Technologies for fine-grained completion tracking of memory buffer accesses include a compute device. The compute device is to establish multiple counter pairs for a memory buffer. Each counter pair includes a locally managed offset and a completion counter. The compute device is also to receive a request from a remote compute device to access the memory buffer, assign one of the counter pairs to the request, advance the locally managed offset of the assigned counter pair by the amount of data to be read or written, and advance the completion counter of the assigned counter pair as the data is read from or written to the memory buffer. Other embodiments are also described and claimed.

    Technologies for sideband performance tracing of network traffic

    公开(公告)号:US10135711B2

    公开(公告)日:2018-11-20

    申请号:US14979140

    申请日:2015-12-22

    Abstract: Technologies for tracing network performance include a network computing device configured to receive a network packet from a source endpoint node, process the received network packet, capture trace data corresponding to the network packet as it is processed by the network computing device, and transmit the received network packet to a target endpoint node. The network computing device is further configured to generate a trace data network packet that includes at least a portion of the captured trace data and transmit the trace data network packet to the destination endpoint node. The destination endpoint node is configured to monitor performance of the network by reconstructing a trace of the network packet based on the trace data of the trace data network packet. Other embodiments are described herein.

Patent Agency Ranking