TUNING OF LOOP ORDERS IN BLOCKED DENSE BASIC LINEAR ALGEBRA SUBROUTINES

    公开(公告)号:US20190318225A1

    公开(公告)日:2019-10-17

    申请号:US16454318

    申请日:2019-06-27

    Abstract: An example includes a sequence generator to generate a plurality of sequence pairs, a first one of the sequence pairs including: (i) a first input sequence representing first accesses to first tensors in a first loop nest of a first computer program, and (ii) a first output sequence representing a first tuned loop nest corresponding to the first accesses to the first tensors in the first loop nest; a model trainer to train a recurrent neural network based on the sequence pairs as training data, the recurrent neural network to be trained to tune loop ordering of a second computer program based on a second input sequence representing second accesses to a second tensor in a second loop nest of the second computer program; and a memory interface to store, in memory, a trained model corresponding to the recurrent neural network.

    METHODS AND APPARATUS TO OPTIMIZE WORKFLOWS
    12.
    发明申请

    公开(公告)号:US20190317805A1

    公开(公告)日:2019-10-17

    申请号:US16451974

    申请日:2019-06-25

    Abstract: Methods, apparatus, systems and articles of manufacture are disclosed that optimize workflows. An example apparatus includes an intent determiner to determine an objective of a user input, the objective indicating a task to be executed in an infrastructure, a configuration composer to compose a plurality of workflows based on the determined objective, a model executor to execute a machine learning model to create a confidence score relating to the plurality of workflows, and a workflow selector to select at least one of the plurality of workflows for execution in the infrastructure, the selection of the at least one of the plurality of workflows based on the confidence score.

    METHODS AND SYSTEMS TO IDENTIFY AND REPRODUCE CONCURRENCY VIOLATIONS IN MULTI-THREADED PROGRAMS USING EXPRESSIONS
    15.
    发明申请
    METHODS AND SYSTEMS TO IDENTIFY AND REPRODUCE CONCURRENCY VIOLATIONS IN MULTI-THREADED PROGRAMS USING EXPRESSIONS 审中-公开
    使用表达法识别并复制多个程序中的同时违反的方法和系统

    公开(公告)号:US20150363306A1

    公开(公告)日:2015-12-17

    申请号:US14836103

    申请日:2015-08-26

    Abstract: Methods and systems to identify threads responsible for causing a concurrency bug in a computer program having a plurality of concurrently executing threads are disclosed. An example method disclosed herein includes defining, with a processor, a data type. The data type including a first predicate, the first predicate being invoked using a first program instruction inserted in a first thread of the plurality of threads, a second predicate, the second predicate being invoked using a second program instruction inserted in a second thread of the plurality of threads, and an expression defining a relationship between the first predicate and the second predicate. The method further includes, in response to determining the relationship is satisfied during execution of the computer program, identifying the first thread and the second thread as responsible for the concurrency bug.

    Abstract translation: 公开了用于识别负责在具有多个同时执行的线程的计算机程序中引起并发错误的线程的方法和系统。 本文公开的示例性方法包括使用处理器来定义数据类型。 所述数据类型包括第一谓词,使用插入在所述多个线程的第一线程中的第一程序指令来调用所述第一谓词,第二谓词,所述第二谓词使用插入到所述第一线索的第二线程中的第二程序指令来调用 多个线程,以及定义第一谓词和第二谓词之间的关系的表达式。 该方法还包括响应于确定在执行计算机程序期间的关系得到满足,识别第一线程和第二线程,以负责并发错误。

    Methods and apparatus to optimize workflows

    公开(公告)号:US11656903B2

    公开(公告)日:2023-05-23

    申请号:US16451974

    申请日:2019-06-25

    CPC classification number: G06F9/4881 G06N20/00

    Abstract: Methods, apparatus, systems and articles of manufacture are disclosed that optimize workflows. An example apparatus includes an intent determiner to determine an objective of a user input, the objective indicating a task to be executed in an infrastructure, a configuration composer to compose a plurality of workflows based on the determined objective, a model executor to execute a machine learning model to create a confidence score relating to the plurality of workflows, and a workflow selector to select at least one of the plurality of workflows for execution in the infrastructure, the selection of the at least one of the plurality of workflows based on the confidence score.

    Methods and apparatus to optimize execution of a machine learning model

    公开(公告)号:US11507838B2

    公开(公告)日:2022-11-22

    申请号:US16456863

    申请日:2019-06-28

    Abstract: Methods, apparatus, systems and articles of manufacture to optimize execution of a machine learning model are disclosed. An example apparatus includes a quantizer to quantize a layer of a model based on an execution constraint, the layer of the model represented by a matrix. A packer is to pack the quantized layer of the matrix to create a packed layer represented by a packed matrix, the packed matrix having non-zero values of the matrix grouped together along at least one of a row or a column of the matrix. A blocker is to block the packed layer into a blocked layer by dividing the non-zero values in the packed matrix into blocks. A fuser is to fuse the blocked layer into a pipeline. A packager is to package the pipeline into a binary.

Patent Agency Ranking