Matrix operation optimization mechanism

    公开(公告)号:US12039000B2

    公开(公告)日:2024-07-16

    申请号:US18163418

    申请日:2023-02-02

    CPC classification number: G06F17/16 G06F7/78 G06N3/044 G06N3/084

    Abstract: An apparatus to facilitate machine learning matrix processing is disclosed. The apparatus comprises a memory to store matrix data one or more processors to execute an instruction to examine a message descriptor included in the instruction to determine a type of matrix layout manipulation operation that is to be executed, examine a message header included in the instruction having a plurality of parameters that define a two-dimensional (2D) memory surface that is to be retrieved, retrieve one or more blocks of the matrix data from the memory based on the plurality of parameters and a register file including a plurality of registers, wherein the one or more blocks of the matrix data is stored within a first set of the plurality of registers.

    MATRIX OPERATION OPTIMIZATION MECHANISM
    2.
    发明公开

    公开(公告)号:US20230289399A1

    公开(公告)日:2023-09-14

    申请号:US18163418

    申请日:2023-02-02

    CPC classification number: G06F17/16 G06F7/78 G06N3/084 G06N3/044

    Abstract: An apparatus to facilitate machine learning matrix processing is disclosed. The apparatus comprises a memory to store matrix data one or more processors to execute an instruction to examine a message descriptor included in the instruction to determine a type of matrix layout manipulation operation that is to be executed, examine a message header included in the instruction having a plurality of parameters that define a two-dimensional (2D) memory surface that is to be retrieved, retrieve one or more blocks of the matrix data from the memory based on the plurality of parameters and a register file including a plurality of registers, wherein the one or more blocks of the matrix data is stored within a first set of the plurality of registers.

    Abstraction layers for scalable distributed machine learning

    公开(公告)号:US11094029B2

    公开(公告)日:2021-08-17

    申请号:US15482953

    申请日:2017-04-10

    Abstract: One embodiment provides for a method of transmitting data between multiple compute nodes of a distributed compute system, the method comprising creating a global view of communication operations to be performed between the multiple compute nodes of the distributed compute system, the global view created using information specific to a machine learning model associated with the distributed compute system; using the global view to determine a communication cost of the communication operations; and automatically determining a number of network endpoints for use in transmitting the data between the multiple compute nodes of the distributed compute system.

    Matrix operation optimization mechanism

    公开(公告)号:US11593454B2

    公开(公告)日:2023-02-28

    申请号:US16890122

    申请日:2020-06-02

    Abstract: An apparatus to facilitate machine learning matrix processing is disclosed. The apparatus comprises a memory to store matrix data one or more processors to execute an instruction to examine a message descriptor included in the instruction to determine a type of matrix layout manipulation operation that is to be executed, examine a message header included in the instruction having a plurality of parameters that define a two-dimensional (2D) memory surface that is to be retrieved, retrieve one or more blocks of the matrix data from the memory based on the plurality of parameters and a register file including a plurality of registers, wherein the one or more blocks of the matrix data is stored within a first set of the plurality of registers.

    MATRIX OPERATION OPTIMIZATION MECHANISM

    公开(公告)号:US20210374209A1

    公开(公告)日:2021-12-02

    申请号:US16890122

    申请日:2020-06-02

    Abstract: An apparatus to facilitate machine learning matrix processing is disclosed. The apparatus comprises a memory to store matrix data one or more processors to execute an instruction to examine a message descriptor included in the instruction to determine a type of matrix layout manipulation operation that is to be executed, examine a message header included in the instruction having a plurality of parameters that define a two-dimensional (2D) memory surface that is to be retrieved, retrieve one or more blocks of the matrix data from the memory based on the plurality of parameters and a register file including a plurality of registers, wherein the one or more blocks of the matrix data is stored within a first set of the plurality of registers.

    MATRIX OPERATION OPTIMIZATION MECHANISM

    公开(公告)号:US20240427842A1

    公开(公告)日:2024-12-26

    申请号:US18674212

    申请日:2024-05-24

    Abstract: An apparatus to facilitate machine learning matrix processing is disclosed. The apparatus comprises a memory to store matrix data one or more processors to execute an instruction to examine a message descriptor included in the instruction to determine a type of matrix layout manipulation operation that is to be executed, examine a message header included in the instruction having a plurality of parameters that define a two-dimensional (2D) memory surface that is to be retrieved, retrieve one or more blocks of the matrix data from the memory based on the plurality of parameters and a register file including a plurality of registers, wherein the one or more blocks of the matrix data is stored within a first set of the plurality of registers.

Patent Agency Ranking