PACKED 16 BITS INSTRUCTION PIPELINE
    1.
    发明申请

    公开(公告)号:US20190129718A1

    公开(公告)日:2019-05-02

    申请号:US15799560

    申请日:2017-10-31

    Abstract: Systems, apparatuses, and methods for routing traffic between clients and system memory are disclosed. A computing system includes a processor capable of executing single precision mathematical instructions on data sizes of M bits and half precision mathematical instructions on data sizes of N bits, which is less than M bits. At least two source operands with M bits indicated by a received instruction are read from a register file. If the instruction is a packed math instruction, at least a first source operand with a size of N bits less than M bits is selected from either a high portion or a low portion of one of the at least two source operands read from the register file. The instruction includes fields storing bits, each bit indicating the high portion or the low portion of a given source operand associated with a register identifier specified elsewhere in the instruction.

    Flexibly deriving intended thread data exchange patterns

    公开(公告)号:US10664285B1

    公开(公告)日:2020-05-26

    申请号:US16226411

    申请日:2018-12-19

    Abstract: A method of deriving intended thread data exchange patterns from source code includes identifying, based on a constant array, a pattern of data exchange between a plurality of threads in a wavefront. The constant array includes an array of source lane values identifying a thread location within the wavefront to read from for performing the pattern of data exchange. The pattern of data exchange is identified as a hardware-accelerated exchange pattern based on the constant array.

Patent Agency Ranking