-
41.
公开(公告)号:US11144293B2
公开(公告)日:2021-10-12
申请号:US17139364
申请日:2020-12-31
Applicant: SAS Institute Inc.
Inventor: Henry Gabriel Victor Bequet , Ronald Earl Stogner
IPC: G06F8/51 , G06F9/46 , G06N3/08 , G06F16/901 , G06F16/903 , H04L29/08
Abstract: An apparatus includes at least one processor to retrieve a job flow definition defining a job flow as a set of tasks and dependencies thereamong, store a job performance request message to perform the job flow within a job queue, and in response to the storage of the job performance request message, execute instructions of a performance routine within a storage container to: based on the dependencies, derive an order of performance of the set of tasks that specifies a first task to perform; store, within a task queue, a first task routine execution request message requesting execution of a first task routine; and provide, to a resource allocation routine, an indication of a need for a first task container in which to execute the first task routine to perform the first task, wherein execution of the resource allocation routine causes dynamic allocation of containers based on availability of resources.
-
42.
公开(公告)号:US11137990B2
公开(公告)日:2021-10-05
申请号:US17139546
申请日:2020-12-31
Applicant: SAS Institute Inc.
Inventor: Henry Gabriel Victor Bequet , Eric Jian Yang , Qing Gong , Kais Arfaoui , Ronald Earl Stogner , Partha Dutta
IPC: G06F9/46 , G06F8/51 , G06N3/08 , G06F16/901 , G06F16/903 , H04L29/08
Abstract: An apparatus includes at least on processor to: parse a job flow definition for a job flow to identify an implicit expression of a data dependency arising from a data object output by a first task of the job flow and input to a second task thereof, wherein the first task is referred to as a data object required by the second as an input; in response to identifying the implicit expression, derive an order of performance of the tasks of the job flow that includes performing the first task before the second to ensure generation of the data object prior to performance of the second; for each task, retrieve a corresponding task routine; execute the task routines in an order that follows the order of performance of the tasks; and transmit, to the requesting device via the network, an indication of successful performance of the job flow.
-
公开(公告)号:US20210294568A1
公开(公告)日:2021-09-23
申请号:US17001195
申请日:2020-08-24
Applicant: SAS Institute Inc.
Inventor: Claude Edward Summers, II , Sean Patrick Mealin , Julianna Elizabeth Langston , Gregory David Kraus , Jonathan Tyler Williamson , Lisa Beth Morton Robinson , Jesse Daniel Sookne , Brice Joseph Smith
IPC: G06F3/16 , G06F3/0481 , G10L13/00 , H04S7/00 , G06F3/01
Abstract: Geospatial data can be converted into audio outputs. For example, a system can receive a dataset indicating geospatial locations of objects within a region. Based on the dataset, the system can generate a virtual map representing the region and including virtual points representing the objects. The virtual points can be spatially positioned at locations in the virtual map corresponding to the geospatial locations of the objects in the region. The system can receive a user input via a user input device for interacting with a particular virtual point among the virtual points in the virtual map. The system can determine one or more sound characteristics for a sound based on receiving the user input. The system can then transmit an audio signal to an audio device for causing the audio device to generate the sound having the one or more sound characteristics, which may assist with exploring the virtual map.
-
44.
公开(公告)号:US20210263949A1
公开(公告)日:2021-08-26
申请号:US17173308
申请日:2021-02-11
Applicant: SAS Institute Inc.
Inventor: James Allen Cox , Nancy Anne Rausch
Abstract: Computerized pipelines can transform input data into data structures compatible with models in some examples. In one such example, a system can obtain a first table that includes first data referencing a set of subjects. The system can then execute a sequence of processing operations on the first data in a particular order defined by a data-processing pipeline to modify an analysis table to include features associated with the set of subjects. Executing each respective processing operation in the sequence to generate the modified analysis table may involve: deriving a respective set of features from the first data by executing a respective feature-extraction operation on the first data; and adding the respective set of features to the analysis table. The system may then execute a predictive model on the modified analysis table for generating a predicted value based on the modified analysis table.
-
公开(公告)号:US11100428B2
公开(公告)日:2021-08-24
申请号:US16706912
申请日:2019-12-09
Applicant: SAS Institute Inc.
Inventor: Xu Chen
Abstract: A computing device predicts occurrence of an event or classifies an object using distributed unlabeled data. A Laplacian matrix is computed using a kernel function. A predefined number of eigenvectors is selected from a decomposed Laplacian matrix to define a decomposition matrix. A gradient value is computed as a function of the defined decomposition matrix, a plurality of sparse coefficients, and a label matrix, a value of each coefficient of the plurality of sparse coefficients is updated based on the computed gradient value, and the computations are repeated until a convergence parameter value indicates the plurality of sparse coefficients have converged. A classification matrix is defined using the plurality of sparse coefficients to determine the target variable value for each observation vector of the plurality of unclassified observation vectors. The target variable value for each observation vector of the plurality of unclassified observation vectors is output.
-
公开(公告)号:US11093864B1
公开(公告)日:2021-08-17
申请号:US17093826
申请日:2020-11-10
Applicant: SAS Institute Inc.
Inventor: Brandon Michael Reese
Abstract: A computing system computes a variable relevance using a trained tree model. (A) A next child node is selected. (B) A number of observations associated with the next child node is computed. (C) A population ratio value is computed. (D) A next leaf node is selected. (E) First observations are identified. (F) A first impurity value is computed for the first observations. (G) Second observations are identified when the first observations are associated with the descending child nodes. (H) A second impurity value is computed for the second observations. (I) A gain contribution is computed. (J) A node gain value is updated. (K) (D) through (J) are repeated. (L) A variable gain value is updated for a variable associated with the split test. (M) (A) through (L) are repeated. (N) A set of relevant variables is selected based on the variable gain value.
-
公开(公告)号:US11087215B1
公开(公告)日:2021-08-10
申请号:US17224708
申请日:2021-04-07
Applicant: SAS Institute Inc.
Inventor: Xu Chen
Abstract: A computing device classifies unclassified observations. A first batch of noise observations is generated. (A) A first batch of unclassified observations is selected. (B) A first batch of classified observations is selected. (C) A discriminator neural network model trained to classify unclassified observations and noise observations is updated with observations that include the first batch of unclassified observations, the first batch of classified observations, and the first batch of noise observations. (D) A discriminator loss value is computed that includes an adversarial loss term computed using a predefined transition matrix. (E) A second batch of unclassified observations is selected. (F) A second batch of noise observations is generated. (G) A generator neural network model trained to generate a fake observation vector for the second batch of noise observations is updated with the second batch of unclassified observations and the second batch of noise observations. (H) (A) to (G) is repeated.
-
公开(公告)号:US11087077B2
公开(公告)日:2021-08-10
申请号:US17089962
申请日:2020-11-05
Applicant: SAS Institute Inc.
IPC: G06K9/00 , G06F40/169 , G06F16/93 , G06F40/284 , G06F40/186 , G06K9/46 , G06F3/0484
Abstract: Embodiments are generally directed to techniques for extracting contextually structured data from document images, such as by automatically identifying document layout, document data, and/or document metadata in a document image, for instance. Many embodiments are particularly directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. For example, the document template database may include a plurality of templates for identifying/explaining key data elements in various document image formats that can be used to extract contextually structured data from incoming document images with a matching document image format. Several embodiments are particularly directed to automatically identifying and associating document metadata with corresponding document data in a document image, such as for generating a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation of a document may be used to generate a template for the template database.
-
公开(公告)号:US11087033B1
公开(公告)日:2021-08-10
申请号:US17153731
申请日:2021-01-20
Applicant: SAS Institute Inc.
Inventor: Ryan Adam Lekivetz , Bradley Allen Jones , Joseph Albert Morgan , Caleb Bridges King
IPC: G06F30/12 , G06F17/18 , G06F30/27 , G06F111/20
Abstract: A computing system generates a subset of design cases of candidate design cases. The system indexes, in the subset, data elements. The system generates a design of an experiment by, for each respective data element, determining a status indicating whether the respective data element corresponds to an uncontrolled factor or a controlled factor. When the status indicates the uncontrolled factor, the system determines if substituting a respective set of specified options of a respective candidate design case comprising the respective data element with a different set of specified options of the candidate design cases improves a criterion measure according to a design criterion. When the status indicates the controlled factor, the system determines if changing an assigned option of the respective data element improves the criterion measure. The system updates the criterion measure with an updated criterion measure according to a change of the subset based on generating the design.
-
公开(公告)号:US11074412B1
公开(公告)日:2021-07-27
申请号:US17202413
申请日:2021-03-16
Applicant: SAS Institute Inc.
IPC: G06F40/30 , G06N20/00 , G06F40/284 , G06N3/04 , G06N3/08
Abstract: A system trains a classification model. Text windows are defined from tokens based on a window size. A network model including a transformer network is trained with the text windows to define classification information. A first accuracy value is computed. (A) The window size is reduced using a predefined reduction factor value. (B) Second text windows are defined based on the reduced window size. (C) Retrain the network model with the second text windows to define classification information. (D) A second accuracy value is computed. (E) An accuracy reduction value is computed from the second accuracy value relative to the first accuracy value. When the computed accuracy reduction value is ≥an accuracy reduction tolerance value, repeat (A)-(E) until the accuracy reduction value is
-
-
-
-
-
-
-
-
-