SYSTEMS AND METHODS FOR PERFORMING INSTRUCTIONS TO TRANSFORM MATRICES INTO ROW-INTERLEAVED FORMAT

    公开(公告)号:US20220357950A1

    公开(公告)日:2022-11-10

    申请号:US17865849

    申请日:2022-07-15

    Abstract: Disclosed embodiments relate to systems and methods for performing instructions to transform matrices into a row-interleaved format. In one example, a processor includes fetch and decode circuitry to fetch and decode an instruction having fields to specify an opcode and locations of source and destination matrices, wherein the opcode indicates that the processor is to transform the specified source matrix into the specified destination matrix having the row-interleaved format; and execution circuitry to respond to the decoded instruction by transforming the specified source matrix into the specified RowInt-formatted destination matrix by interleaving J elements of each J-element sub-column of the specified source matrix in either row-major or column-major order into a K-wide submatrix of the specified destination matrix, the K-wide submatrix having K columns and enough rows to hold the J elements.

    SYSTEMS AND METHODS FOR IMPLEMENTING CHAINED TILE OPERATIONS

    公开(公告)号:US20190303167A1

    公开(公告)日:2019-10-03

    申请号:US15942201

    申请日:2018-03-30

    Abstract: Disclosed embodiments relate to systems and methods for implementing chained tile operations. In one example, a processor includes fetch circuitry to fetch one or more instructions until a plurality of instructions has been fetched, each instruction to specify source and destination tile operands, decode circuitry to decode the fetched instructions, and execution circuitry, responsive to the decoded instructions, to: identify first and second decoded instructions belonging to a chain of instructions, dynamically select and configure a SIMD path comprising first and second processing engines (PE) to execute the first and second decoded instructions, and set aside the specified destination of the first decoded instruction, and instead route a result of the first decoded instruction from the first PE to be used by the second PE to perform the second decoded instruction.

    SYSTEMS, APPARATUSES, AND METHODS FOR DATA SPECULATION EXECUTION

    公开(公告)号:US20190121644A1

    公开(公告)日:2019-04-25

    申请号:US14582859

    申请日:2014-12-24

    Abstract: Systems, methods, and apparatuses for data speculation execution (DSX) are described. In some embodiments, a hardware apparatus for DSX comprises execution hardware to execute instructions to begin and end a data speculative execution (DSX) and speculative instructions during the DSX, and DSX tracking hardware to track speculative memory accesses and detect ordering violations in a DSX of speculative instructions using a sequence number, addresses of instruction accesses, and whether an instruction being tracked is a write, and to trigger a mis-speculation upon an ordering violation.

    SYSTEMS AND METHODS FOR PERFORMING INSTRUCTIONS SPECIFYING TERNARY TILE LOGIC OPERATIONS

    公开(公告)号:US20190042260A1

    公开(公告)日:2019-02-07

    申请号:US16131376

    申请日:2018-09-14

    Abstract: Disclosed embodiments relate to systems and methods for performing instructions specifying ternary tile operations. In one example, a processor includes fetch and decode circuitry to fetch and decode an instruction specifying a ternary tile operation, and locations of destination and first, second, and third source matrices, each of the matrices having M rows by N columns; and execution circuitry to respond to the decoded instruction by, for each equal-sized group of K elements of the specified first, second, and third source matrices, generate K results by performing the ternary tile operation in parallel on K corresponding elements of the specified first, second, and third source matrices, and store each of the K results to a corresponding element of the specified destination matrix, wherein corresponding elements of the specified source and destination matrices occupy a same relative position within their associated matrix.

    APPARATUS AND METHOD FOR PARTITIONED SHUFFLES

    公开(公告)号:US20250103337A1

    公开(公告)日:2025-03-27

    申请号:US18373900

    申请日:2023-09-27

    Abstract: An apparatus and method for partitioned shuffling of data elements. A first partition is associated with a first number of source data elements corresponding to a first plurality of lanes having a first plurality of lane identifiers (IDs) and a second partition is associated with a second number of source data elements corresponding to a second plurality of lanes having a second plurality of lane IDs. A bounded offset vector is generated based on allowable ranges for a plurality of offset values associated with the source data elements. An index vector is generated by permuting the first and second plurality of lane IDs in accordance with the bounded offset vector.

Patent Agency Ranking