Automated message-based job flow resource management in container-supported many task computing

    公开(公告)号:US11144293B2

    公开(公告)日:2021-10-12

    申请号:US17139364

    申请日:2020-12-31

    Abstract: An apparatus includes at least one processor to retrieve a job flow definition defining a job flow as a set of tasks and dependencies thereamong, store a job performance request message to perform the job flow within a job queue, and in response to the storage of the job performance request message, execute instructions of a performance routine within a storage container to: based on the dependencies, derive an order of performance of the set of tasks that specifies a first task to perform; store, within a task queue, a first task routine execution request message requesting execution of a first task routine; and provide, to a resource allocation routine, an indication of a need for a first task container in which to execute the first task routine to perform the first task, wherein execution of the resource allocation routine causes dynamic allocation of containers based on availability of resources.

    COMPUTERIZED PIPELINES FOR TRANSFORMING INPUT DATA INTO DATA STRUCTURES COMPATIBLE WITH MODELS

    公开(公告)号:US20210263949A1

    公开(公告)日:2021-08-26

    申请号:US17173308

    申请日:2021-02-11

    Abstract: Computerized pipelines can transform input data into data structures compatible with models in some examples. In one such example, a system can obtain a first table that includes first data referencing a set of subjects. The system can then execute a sequence of processing operations on the first data in a particular order defined by a data-processing pipeline to modify an analysis table to include features associated with the set of subjects. Executing each respective processing operation in the sequence to generate the modified analysis table may involve: deriving a respective set of features from the first data by executing a respective feature-extraction operation on the first data; and adding the respective set of features to the analysis table. The system may then execute a predictive model on the modified analysis table for generating a predicted value based on the modified analysis table.

    Distributable event prediction and machine learning recognition system

    公开(公告)号:US11100428B2

    公开(公告)日:2021-08-24

    申请号:US16706912

    申请日:2019-12-09

    Inventor: Xu Chen

    Abstract: A computing device predicts occurrence of an event or classifies an object using distributed unlabeled data. A Laplacian matrix is computed using a kernel function. A predefined number of eigenvectors is selected from a decomposed Laplacian matrix to define a decomposition matrix. A gradient value is computed as a function of the defined decomposition matrix, a plurality of sparse coefficients, and a label matrix, a value of each coefficient of the plurality of sparse coefficients is updated based on the computed gradient value, and the computations are repeated until a convergence parameter value indicates the plurality of sparse coefficients have converged. A classification matrix is defined using the plurality of sparse coefficients to determine the target variable value for each observation vector of the plurality of unclassified observation vectors. The target variable value for each observation vector of the plurality of unclassified observation vectors is output.

    Distributable feature analysis and tree model training system

    公开(公告)号:US11093864B1

    公开(公告)日:2021-08-17

    申请号:US17093826

    申请日:2020-11-10

    Abstract: A computing system computes a variable relevance using a trained tree model. (A) A next child node is selected. (B) A number of observations associated with the next child node is computed. (C) A population ratio value is computed. (D) A next leaf node is selected. (E) First observations are identified. (F) A first impurity value is computed for the first observations. (G) Second observations are identified when the first observations are associated with the descending child nodes. (H) A second impurity value is computed for the second observations. (I) A gain contribution is computed. (J) A node gain value is updated. (K) (D) through (J) are repeated. (L) A variable gain value is updated for a variable associated with the split test. (M) (A) through (L) are repeated. (N) A set of relevant variables is selected based on the variable gain value.

    Machine learning classification system

    公开(公告)号:US11087215B1

    公开(公告)日:2021-08-10

    申请号:US17224708

    申请日:2021-04-07

    Inventor: Xu Chen

    Abstract: A computing device classifies unclassified observations. A first batch of noise observations is generated. (A) A first batch of unclassified observations is selected. (B) A first batch of classified observations is selected. (C) A discriminator neural network model trained to classify unclassified observations and noise observations is updated with observations that include the first batch of unclassified observations, the first batch of classified observations, and the first batch of noise observations. (D) A discriminator loss value is computed that includes an adversarial loss term computed using a predefined transition matrix. (E) A second batch of unclassified observations is selected. (F) A second batch of noise observations is generated. (G) A generator neural network model trained to generate a fake observation vector for the second batch of noise observations is updated with the second batch of unclassified observations and the second batch of noise observations. (H) (A) to (G) is repeated.

    Techniques for extracting contextually structured data from document images

    公开(公告)号:US11087077B2

    公开(公告)日:2021-08-10

    申请号:US17089962

    申请日:2020-11-05

    Abstract: Embodiments are generally directed to techniques for extracting contextually structured data from document images, such as by automatically identifying document layout, document data, and/or document metadata in a document image, for instance. Many embodiments are particularly directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. For example, the document template database may include a plurality of templates for identifying/explaining key data elements in various document image formats that can be used to extract contextually structured data from incoming document images with a matching document image format. Several embodiments are particularly directed to automatically identifying and associating document metadata with corresponding document data in a document image, such as for generating a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation of a document may be used to generate a template for the template database.

    Tool for design experiments with uncontrolled factors

    公开(公告)号:US11087033B1

    公开(公告)日:2021-08-10

    申请号:US17153731

    申请日:2021-01-20

    Abstract: A computing system generates a subset of design cases of candidate design cases. The system indexes, in the subset, data elements. The system generates a design of an experiment by, for each respective data element, determining a status indicating whether the respective data element corresponds to an uncontrolled factor or a controlled factor. When the status indicates the uncontrolled factor, the system determines if substituting a respective set of specified options of a respective candidate design case comprising the respective data element with a different set of specified options of the candidate design cases improves a criterion measure according to a design criterion. When the status indicates the controlled factor, the system determines if changing an assigned option of the respective data element improves the criterion measure. The system updates the criterion measure with an updated criterion measure according to a change of the subset based on generating the design.

    Machine learning classification system

    公开(公告)号:US11074412B1

    公开(公告)日:2021-07-27

    申请号:US17202413

    申请日:2021-03-16

    Abstract: A system trains a classification model. Text windows are defined from tokens based on a window size. A network model including a transformer network is trained with the text windows to define classification information. A first accuracy value is computed. (A) The window size is reduced using a predefined reduction factor value. (B) Second text windows are defined based on the reduced window size. (C) Retrain the network model with the second text windows to define classification information. (D) A second accuracy value is computed. (E) An accuracy reduction value is computed from the second accuracy value relative to the first accuracy value. When the computed accuracy reduction value is ≥an accuracy reduction tolerance value, repeat (A)-(E) until the accuracy reduction value is

Patent Agency Ranking