SYSTEMS AND METHODS FOR DATA-AWARE STORAGE TIERING FOR DEEP LEARNING

    公开(公告)号:US20220327376A1

    公开(公告)日:2022-10-13

    申请号:US17226917

    申请日:2021-04-09

    Abstract: Systems and methods are configured to split an epoch associated with a training dataset into a plurality of mini-epochs. A machine learning model can be trained with a mini-epoch of the plurality of mini-epochs. The mini-epoch can be, during the training, iterated for a number of times during the training. One or more metrics reflective of at least one of: a training loss, training accuracy, or validation accuracy of the machine learning model associated with the mini-epoch can be received. Whether to terminate iterations of the mini-epoch early before a number of iterations of the mini-epoch reaches the number of times based on the one or more metrics can be determined. The number of iterations can be a non-zero number.

    Associating insights with data
    24.
    发明授权

    公开(公告)号:US10936637B2

    公开(公告)日:2021-03-02

    申请号:US15483498

    申请日:2017-04-10

    Abstract: Some examples relate to associating an insight with data. In an example, data may be received. A determination may be made that data type of the data is same as compared to an earlier data. An insight generated from the earlier data may be identified, wherein the insight may represent intermediate or resultant data generated upon processing of the earlier data by an analytics function, and wherein during generation metadata is associated with the insight. An analytics function used for generating the insight may be identified.

    DATA TRANSFER USING SNAPSHOT DIFFERENCING FROM EDGE SYSTEM TO CORE SYSTEM

    公开(公告)号:US20200301881A1

    公开(公告)日:2020-09-24

    申请号:US16361310

    申请日:2019-03-22

    Abstract: A source system generates snapshots of collected data. The snapshots have respective associated time references. Responsive to a request from a target system for data collected over a time interval, the source system generates a subset of the data collected by determining a start snapshot and an end snapshot. The start snapshot and the end snapshot are determined as a pair of snapshots that have respective associated time references that are most closely spaced and are inclusive of the time interval. The source system determines a difference in the data included in the end snapshot and the start snapshot and provides the subset of the data as the difference in the data included in the end snapshot and the start snapshot.

    Data provisioning for an analytical process based on lineage metadata

    公开(公告)号:US20180096080A1

    公开(公告)日:2018-04-05

    申请号:US15281254

    申请日:2016-09-30

    CPC classification number: G06F16/254

    Abstract: Examples disclosed herein relate to data provisioning for an analytical process based on lineage metadata. In an example, a value of a parameter related to execution of an analytical process on a remote hub device may be determined based on lineage metadata stored on an edge device, wherein the analytical process is part of an analytical workflow that may be implemented at least in part on the edge device and the remote hub device. In response to a determination that the value of the parameter is above a predefined threshold, the edge device may provide to the remote hub device, input data for a future execution of the analytical process in advance of execution of the analytical process on the remote hub device without a request for the input data by the remote hub device.

    ARTIFICIAL INTELLIGENCE OPTIMIZATION PLATFORM

    公开(公告)号:US20220230024A1

    公开(公告)日:2022-07-21

    申请号:US17153852

    申请日:2021-01-20

    Abstract: Systems and methods are provided for reusing machine learning models. For example, the applicability of prior models may be compared using one or more assessment values, including a similarity threshold and/or an accuracy threshold. The similarity threshold may identify a similarity of data between a first data set used to generate a first model and a new data set that is received by the system. When the similarity between these two data sets is exceeded, the system may reuse a model with the highest similarity value. When an accuracy value of the data set does not exceed an accuracy threshold, the system may initiate a retraining process to generate a second ML model associated with the second data.

Patent Agency Ranking